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Part I 


Information status and NP 
marking 


Chapter 1 


Referential cohesion in Bunun: A 
comparison of two genres 
Rik De Busser 


National Chengchi University 


This chapter investigates how referential expressions are involved in establishing and main- 
taining textual cohesion in Bunun, an Austronesian language of Taiwan, and how this be- 
haviour varies across genres. Relying on a model of referential cohesion inspired by sys- 
temic-functional grammar, it explores differences and similarities for encoding referential 
continuity across sentence boundaries in oral and narrative text. It concludes that, contrary 
to initial expectation, and despite considerable formal differences in how referential expres- 
sions are realized, at a more fundamental level the properties of referential cohesion are 
unexpectedly stable across genres. 


1 Introduction 


1.1 Cohesion 


Now more than four decades ago, Halliday & Hasan (1976) published their seminal work 
on the linguistic subsystem that helps creating coherent text by establishing connec- 
tions between related semantic elements in that text. More specifically, it is “a set of lexi- 
cogrammatical systems that have evolved specifically as a resource for making it possible 
to transcend the boundaries of the clause — that is, the domain of the highest-ranking 
grammatical unit” (Halliday & Matthiessen 2004: 532). They referred to this subsystem 
as cohesion and to the connections as cohesive ties, and described it as the set of “rela- 
tions of meaning that exist within the text, and that define it as text” (Halliday & Hasan 
1976: 4). This means that its realization is not confined by clause or other grammatical 
boundaries, but typically operates on the scale of text or discourse. 

It also implies that there is no isomorphic relationship between grammatical devices 
and cohesive effects; cohesion pertains to semantic relationships within texts that “may 
take any one of various forms” (Halliday & Hasan 1976: 13). The original proposal, which 
has been integrated in Halliday's systemic-functional grammar (Halliday 1994; Halliday 
& Matthiessen 2004; Halliday & Matthiessen 2014), distinguishes four types of relation- 
ships: 
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1. REFERENCE establishes cohesive ties between linguistic elements through various 
forms of spatio-temporal and personal deixis, and through comparison. 


2. ELLIPSIS covers all phenomena that establish cohesive links by omitting a gram- 
matical unit, or by swapping it for a placeholder element.1 


3. CONJUNCTION creates logical or spatio-temporal ties between propositions, typi- 
cally through various grammatical mechanisms for clause linking. 


4. LEXICAL COHESION is established between lexical elements through repetition and 
various semantic relationships. 


The markers for each of the four types of cohesive relationship are indicated in the fol- 
lowing examples. 


(1) English 
a. Reference 
That man's dog is much larger than my cat. 


b. Ellipsis 
How many cookies are left? I took twelve @. So did you. 


c. Conjunction 
When it shut down, something went wrong. In short, it caught fire. 


d. Lexical organization 
Emperor penguins protect their chicks from the cold by putting the little 
fluff balls on their feet. 


In (1a), that points to a referent that exists outside the text (exophoric reference), the 
phrase much larger than connects that man's dog and my cat, and the possessive form my 
creates an exophoric link to the speaker. In (1b), ellipsis in the second clause indicates that 
the head of twelve @ refers to the same set of referents as cookies in the first clause. The 
substitutive construction so did in the third clause indicates that its subject performed the 
same action, take [cookies], as the first person in the second clause. In (1c), when creates 
a relationship of simultaneity between the first and second clause, and in short indicates 
that the third clause summarizes the previous discourse. Finally, the penguin-related 
lexical items in (1d) arrange themselves in a complex of lexical cohesive relationships 
(see Figure 1). 


'Halliday & Hasan (1976: 88-141) called the latter substitution and originally considered it to be a separate 
cohesive category, meant to account for forms like one in English expressions such as You can choose the 
blue candy or the red one. They acknowledged that both substitution and ellipsis established cohesive ties 
by replacement, either by zero or by a placeholder (Halliday & Hasan 1976: 88) and both are subsumed 
under ellipsis in Halliday (1994) and later publications. 
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hyponym metaphor 
Emperor penguins < I. chicks a fluff balls 


part-whole 
feet 


Figure 1: Schema of a cohesive chain 


As the examples illustrate, markers of cohesion are highly heterogeneous in their gram- 
matical and relational properties. What they have in common is that they establish co- 
hesive ties, that is, semantic connections between linguistic elements (words, phrases, 
clauses, etc.) that are typically asymmetrical and express that the discourse segments in 
which they occur are to be interpreted as part of a coherent whole. These ties, either 
in isolation or by combining into longer chains, weave through a text. Together with 
thematic structure (theme/rheme contrasts) and focus structure (given/new), cohesion 
thus creates ‘texture’ (Halliday 1994: 334; Halliday & Matthiessen 2004: 579), the percep- 
tion of a text or discourse as a connected whole. Texture in turn is “one aspect of the 
study of coherence, which can be thought of as the process whereby a reading position 
is naturalized by texts for listener/readers” (Martin 2001: 35). 

Importantly, this implies that Hallidayan systemic-functional grammar “does not 
equate cohesion with coherence” (Martin 1992: 27; see also Martin 2001). Cohesion is 
merely one of the linguistic systems responsible for textual coherence. Later work on 
coherence often merged the two concepts, and typically reduced the phenomenon to 
a semantic-pragmatic component responsible for combining clause-level propositions 
into larger rhetorical structures (see e.g. Mann & Thompson 1987; Kehler 2002; Kehler 
2004). 

In sum, cohesion is an information-structuring device that, by establishing semantic 
connections between a heterogeneous set of linguistic units within a text, assists lan- 
guage users in interpreting that text as a cohesive, connected whole. In doing so, it is 
one of the subsystems responsible for structuring the distribution of information ele- 
ments on a textual (supra-sentential) level. 

Cohesion has been explored extensively in theoretical and applied linguistics, but over- 
whelmingly in the context of English (Halliday & Hasan 1976; Connor 1984; Martin 1992; 
Abadiano 1995; Tanskanen 2006; Crossley & McNamara 2012) and occasionally other 
major languages (Aziz 1988 on Arabic; Hickmann & Hendriks 1999 on English, French, 
German and Mandarin; Kruger 2000 on Afrikaans; Hassel 2005 on English, German and 
Norwegian). Work on minority languages is much less common. In the Austronesian 
world, the only studies I am aware of are Ezard (1978) on Tawala, Flaming (1983) on 
Wandamen, and Benn (1991) on Central Bontoc. The first two are literal applications of 
Halliday & Hasan’s framework to their languages; Benn employs a number of frame- 
works, including Halliday & Hasan’s, for his analysis of the discursive structure of Cen- 
tral Bontoc ritual texts. 
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This chapter adapts Halliday & Hasan's original model to fit the needs of analysing 
the role of referential expressions in establishing the cohesive texture of Bunun texts. It 
investigates the role cohesion plays in establishing genre distinctions through a small- 
scale pilot study. 


1.2 Genre and cohesion 


Genres or registers can be defined as specific types of texts or discourse with sets of 
"relatively stable" properties associated with the “thematic content, style, and composi- 
tional structure" that reflects the specific needs of well-defined contexts in which they 
were realized (Bakhtin 1986: 60). 

Distinctions between genres are marked through various linguistic means. Biber (1995: 
28) makes a basic distinction between register markers and register features. The former 
are linguistic cues that are specific to a certain register or genre and therefore directly 
indicate that a text belongs to it. An anecdotal example is the phrase a long time ago 
in a land far, far away introducing a fairy tale. Register features are linguistic elements 
that are not genre-specific, but whose frequency or distribution is in certain situations 
indicative of a specific register or genre. For instance, imperatives are relatively common 
in recipes, but they occur in many other genres as well. Cohesion falls into the latter 
category. 

Research on the relationship between genres and their indicators has mainly focussed 
on the "relative distributions of surface linguistic features, such as adjectives, nominal- 
izations, passives, and various clause types" (Biber 1995: 12). Even Biber, who went well 
beyond previous studies by focussing on complex feature bundles, mainly concentrates 
on morphosyntactic features that can be straightforwardly extracted from the surface re- 
alization of the text (see Biber 1995: 94—104; also Biber & Conrad 2009: 217-226). Given 
that genre is associated with the global discursive and semantic features of texts, one 
should probably assume that these grammatical features serve as proxy indicators of 
certain structural elements of meaning, discourse organisation and information struc- 
ture. 

Cohesion is an important determinant of the distribution of information in text, so 
it is reasonable to assume that it is interconnected with the global properties of text 
structure, and therefore contributes to (Halliday & Hasan 1976) or closely interacts with 
genre (Martin 1992; Martin 2001). There are a number of reasons why one would expect 
consistent correlations between cohesion and genre, many linked to the accessibility of 
linguistic information (Lambrecht 1994: 74-116; Ariel 1991). 

First, expectation patterns related to the nature and quantity of assumed background 
knowledge and explicitly expressed information are often genre-specific. Specialized 
genres assume a greater volume of background knowledge than more generalized gen- 
res. For example, the presupposed background knowledge in an informal conversation 
is different from that in an academic textbook (Biber & Conrad 2009: 14-15). This affects 
the need for explicitly expressing cohesive relationships between elements in a text. 
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Second, differences in genre commonly correlate with differences in modes of realiza- 
tion, which in term influences the options for realizing cohesion. For instance, oral and 
written genres diverge in which cohesive strategies they employ (see e.g. Fox 1987; Givón 
1993). Textual coherence in writing is partly realized through meta-linguistic means, 
such as writing conventions and punctuation, not available in oral discourse. Because of 
the visual nature of the written medium, information is also more readily, and longer, 
accessible. All things being equal, one would therefore expect that oral genres tend to 
have a more dense cohesive structure (or a larger presence of other coherence-creating 
mechanisms) than their written equivalents, in order to reach an equal level of coher- 
ence. 

For certain types of cohesive relationships, the link between cohesion and genre is 
well understood. For instance, it is uncontroversial that “genre-specific conventions [...] 
play a significant role in anaphoric patterning in conversation and writing” (Fox 1987: 
2). Research explicitly comparing cohesive patterning across genres is scarce, but the 
influence of cohesion on the realization of individual genres is the subject of a number of 
studies. The above-mentioned Benn (1991) investigates cohesion in single genre (written 
essays) in Central Bontoc. Another example are Malah & Rashid (2016) who, based on 
Hoey (1991), explore the role of content words in establishing the cohesive properties of 
English language Nigerian newspaper texts. 

One important question is in which manner exactly cohesion indicates genre in text 
or - from a comparative perspective — how its realizations are indicative of differences 
between genres. Halliday & Hasan suggest that "the possible differences among different 
genres and different authors [are] in the numbers and kinds of tie they typically employ” 
(Halliday & Hasan 1976: 4). In other words, one would expect that (1) cohesive density, 
and (2) the nature of the connections between elements in cohesive relationships varies 
between genres. 

This chapter investigates whether, and to what extent, these two hypotheses are true 
for two text genres, oral narrative and biblical translations, in a Bunun speech commu- 
nity. It compares the cohesive density and the morphosyntactic and semantic-pragmatic 
properties of cohesive ties in these two types of texts. 

Similar to Malah & Rashid (2016), this chapter not discuss all aspects of cohesion as 
they were introduced in Halliday & Hasan (1976). Rather, the discussion focuses on ref- 
erential cohesion, the conceptually coherent subset of cohesive ties that is involved in 
establishing relationships between referential items. Its exact delineation is discussed 
in $1.4. Before this is possible, I first introduce the Bunun language, its dialects and the 
genres involved in the present analysis. 


1.3 Bunun dialects 


Bunun is one of around sixteen Austronesian languages spoken on Taiwan (Li 2008). It 
has five extant dialects that are classified into a Southern (Isbukun), Central (Takbanuaz 
and Takivatan) and Northern branch (Takibakha and Takituduh). Within the Isbukun di- 
alect, at least three distinct varieties are spoken in Kaohsiung, Taitung, and Nantou. Be- 
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tween dialects, especially between Isbukun varieties and dialects of the other branches, 
there is a fair amount of phonological, lexical, and grammatical differentiation (see Li 
1988 for an overview of phonological and lexical variation). Only the Takivatan and Is- 
bukun dialects are relevant to the present discussion. 


(2) Bunun 


a. Takivatan (fieldwork, observed) 
mun-fisaq-?as 
ALL-where-2s.SUBJ 
"Where are you going?' 

b. Isbukun (Lilian Li, pers. comm.) 
ku-?isa? kasu? ma-tuktuk lukis 
ALL-where 28.sUBJ DYN-chop wood 


"Where do/did you go to chop wood?' 


Example (2a-2b) illustrates the degree of discrepancy between the two dialects.? The 
coda of the question word (/q/ in Takivatan, /?/ in Isbukun) is illustrative of a systematic 
phonological contrast. In near-identical contexts, both dialects use different allative pre- 
fixes. Finally, whereas Takivatan prefers a pronominal clitic in subject positions like this, 
Isbukun uses a free pronoun that does not exist in Takivatan (see Table 1 and Table 2). 

Bunun dialects have a verb-initial constituent order and what has been called a West- 
ern Austronesian or Philippine-type voice system (see French 1987; Foley 2007; Riesberg 
2014 for general overviews), which in Bunun distinguishes at minimum between actor 
(Av), undergoer (uv), and locative voice (Lv), marked by suffixes on the verb. In (3a), sida 
is actor voice and as a result unmarked; the vv in (3b) is indicated by a suffix -un, and 
the rv in (3c) by -an. 


(3) Takivatan Bunun 

a. Actor voice ((fieldwork, elicited)) 
na-sida qaimansud. 
IRR-take thing 
‘I will pick up things? 

b. Undergoer voice ((fieldwork, text corpus)) 
manman ni sida-un. 
many NEG take-uv 
^... a lot were not caught? 


c. Locative voice ((fieldwork, text corpus)) 
maqtu pa-sida-an-in nabul vanis. 
can ` CAUS.DYN-take-Lv-PRv deer wild.boar 


*.. and we could catch deer and wild boar? 


?The following changes were made to graphemic conventions: z > ó, ' > 2, ch > dz ng > n 
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Certain analyses additionally include instrumental, beneficiary, and resultative object 
voices, but these forms are relatively uncommon and can be further ignored here. 

The remainder of this section gives a short overview of various deictic paradigms, 
since these are relevant to the discussion at hand. All five Bunun dialects have sets of free 
and bound personal pronouns. Paradigmatic distinctions are largely equivalent, but the 
pronominal sets have formally diverged and have been analysed as expressing different 
grammatical distinctions in Takivatan and Isbukun. Tables 1 and 2 give the pronominal 
paradigms for both dialects. 


Table 1: Personal pronouns in Takivatan Bunun 


(a) Subject Non-subj. Poss. Loc. 
Free Bound Free Bound 
Hak ainak 
1S sak, saikin  -(?)ak ðaku, nak  -(?)uk Tk AO óaku?an 
28 — Cas su?u, su — isu, su su?u?an 
3S (see b) -(2)is (see b) — 
11 fata, in?ata — mita — imita mita?an 
1E óamu,sam -(?7)am  óami nam — inam, nam Óami?an 
2P amu -(?)am | mu?umu — imu, mu mu?u?an 
3P (see b) — (see b) — 
(b) Subject & Non-subject 
Prox MED DisT 
3s isti istun ista 
3P inti intun inta 


Some of the more systematic differences are worth mentioning. Third person pronouns 
in Takivatan differentiate between proximal, medial and distal forms and do not have 
distinct subject and non-subject forms. They can therefore be interpreted as a subset of 
demonstratives (Table 3). In contrast, Isbukun third person pronouns do not encode a 
deictic contrast. Singular forms all appear derived from the stem sia, which in Takivatan 
is an anaphoric form that appears in a number of grammatical positions (De Busser 
2009: 467—474). Plural forms all derive from the stem nai. Zeitoun (2000: 72) suggests 


*The Takivatan data on personal pronouns is from De Busser (2009: 441); the Isbukun data from the Kaohsi- 
ung variety in Huang & Shih (2016: 85). The latter mark vowel length by grapheme doubling. This distinc- 
tion is non-phonemic in Bunun: generally, monosyllabic roots tend to have lengthened vowels, irrespective 
of the environment in which they occur. To make comparison easier, long vowels in the Isbukun exam- 
ples are represented by single vowel graphemes. Subject and non-subject forms are analysed and glossed 
differently in De Busser (2009) and Huang & Shih (2016), in this might reflect subtle differences in the 
grammatical distribution of these forms. Again, to make comparison easier, this terminology has here 
been homogenized. 


Rik De Busser 


Table 2: Personal pronouns in Isbukun Bunun 


Subject Non-subj. Poss. Loc. 
Free Bound Free Bound 
1S saikin -ik óaku -ku inak dakuan 
2S kasu(n) -as su -su isu suan 
38 saia, sai(n) — saiia — isaidgia, isia ^ sifaan dia 
1I kata -ta ita, mita -ta imita mitaan 
1E kaimin -im óami — inam damian 
2P kamu(n) -am mu -mu imu muan 
nai, nian nai (vis), inai (vis), alan daa 
3P (vis), naia — naid;ia — inaidgia (vis) 
(NVIS) (NVIS) (NVIS) 


that variant forms within each category code a visibility distinction. The element dia on 
third person forms is in all likelihood a distal determiner enclitic, making their status of 
as personal pronouns contentious. 

Demonstrative pronouns vary substantially between dialects. De Busser (2017: 95-97) 
describes an elaborate paradigm for Takivatan; see Table 3. 


Table 3: Free demonstratives in Takivatan Bunun 


MED 


PROX DIST USPEC 
S VIS aipi aipun aipa aip 
NVIS naipi naipun naipa naip 
P VIS ainki ainkun ainka = 
NVIS nainki nainkun nainka = 
GNR VIS aiti aitun aita — 
NVIS naiti naitun naita — 
PAUC VIS = — (ainta) = 
NVIS = naintun (nainta) — 


None of these forms has so far been attested in Isbukun. The demonstrative forms de- 
scribed in Huang & Shih (2016: 95) distinguish case and distance, but not visibility. How- 
ever, their paradigm consists of fully transparent combinations of the form sia or the 
spatial adverbs di and adi ‘there’ with various bound determiners (see Table 4), which 
encode both distinctions mentioned above. 

Finally, all Bunun dialects have two sets of bound determiners, which encode a dis- 
tance contrast and can occur on a variety of word classes including verbs (see De Busser 
2009: 427-440 for an explanation). 


10 


1 Referential cohesion in Bunun: A comparison of two genres 


Table 4: Determiners in Isbukun and Takivatan Bunun 


Takivatan Isbukun (Taitung & Kaohsiung) Isbukun (Nantou) 
D1 PROX -ki -in -in 
MED -kun -an -an 
DIST -ka -a -a 
D2 PROX -ti -djn -tin 
MED -tun -tan -tan 
DIST -ta -dia -tia 


Again, there are formal differences, this time even between different varieties of the 
Isbukun dialect. The distinction between n1 and p2 appears to be fundamentally different 
in the two dialects. De Busser (2009: 426-440); De Busser (2017) argues that in Takivatan 
the difference between the two sets is semantic in nature; in Isbukun, the difference is 
associated with case (D1 = NOM, D2 = OBL; see Huang & Shih 2016: 95; Zeitoun 2000: 76). 
Bound determiners are optional, and are considerably more common in Takivatan than 
in other dialects. 

The paradigms above serve as illustrations of the degree of differentiation between 
Takivatan and Isbukun, and give an overview of some of the paradigms that are relevant 
to the present analysis, as they directly influence the difference between different text 
genres in Bunun. 


1.4 Narrative genres in Bunun 


This chapter compares two narrative genres, traditional oral narrative text and biblical 
narrative, as they occur in a single Takivatan Bunun language community in the village 
of Bahuan at the East Coast of Taiwan. 

Traditional narrative texts, despite being transmitted orally, are by nature not impro- 
vised. In traditional Bunun communities, both expository and narrative texts in formal 
settings follow relatively strict conventions that govern amongst other things: (1) who 
has the right to speak and when; (2) how certain traditional knowledge should be pre- 
sented; and (3) which formal aspects, such as formulae related to politeness and the 
veracity of the narration, should be included in specific oral genres. Many of these con- 
ventionalized aspects of stories appear to be the result of an ongoing consultation process 
between the elders, or a larger group of members, of the community. Transgressions of 
these rules are usually pointed out by authoritative members of the community, usually 
male elders. 

A second genre with which many Bunun people are confronted on a nearly daily basis 
are biblical narratives. Presbyterian and Catholic missionaries introduced Christianity 
after the end of the Second World War, and it is an important part of contemporary 
Bunun culture. From the 1940s onwards, especially the Presbyterian Church, through 
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the Bible Society of Taiwan, has been active in translating biblical texts into the Bunun 
language. 

Bible translations are intricate undertakings that typically involve a translation team 
consisting of translators, native language consultants, and theologians. Especially in cul- 
tures that have little historical affinity with the Judaeo-Christian tradition, this process 
is more than simply translating texts: it requires the meticulous transposition of an 
alien conceptual universe with its associated lexical and grammatical framework (see 
De Busser 2013). This makes it nearly impossible to produce translations for every di- 
alect of Bunun. 

The present Bunun Bible translation (Bible Society in Taiwan 2000) will be referred to 
as the Bunun Bible. It is the authoritative translation containing the full New Testament 
and an abbreviated Old Testament, and is heavily based on the Isbukun dialect. Despite 
this, it is used in almost all Bunun churches, irrespective of their denomination or the 
dialect area to which they belong. From a language planning perspective, this made a lot 
of sense: Isbukun is the largest dialect, has the widest geographical spread, and has been 
studied most extensively. This is not to say that the Bunun Bible is a written mirror of 
any specific Isbukun variety: the translation process rather resulted in a supra-dialectal 
written standard for Christian texts in Bunun, which is also used for other religious text 
genres such as hymns. 

An interesting consequence is that, although many Bunun are reading or listening to 
excerpts of the Bible on a regular basis, especially “to members of Takbanuaz, Takivatan, 
Takibakha and Takituduh communities, the language of their Bible is not closely related 
to the common vernacular” (De Busser 2013: 67). Since the discrepancy between Isbukun 
and the Takivatan dialect is considerable, the result is a situation in which two dominant 
narrative genres in the Takivatan language community have relatively divergent dialec- 
tal characteristics. 

This leads to a question: how and to what extent do language users in non-Isbukun 
communities interpret the content of these texts that belong to related genres but have 
quite distinct lexical and grammatical characteristics? To an extent, this is a matter of 
lexical and grammatical overlap between dialects. However, an additional question con- 
cerns the cohesive fabric of these two types of texts. Given the differences between 
grammatical paradigms that are central to establishing cohesion, such as personal pro- 
nouns, demonstratives, and determiners, how do language users keep tab of issues such 
as thematic integrity, topic continuity, and the general distribution of information in a 
narrative progression? 

The present study will investigate this issue by looking at how these two different 
narrative genres realize cohesion, and in a particular subset of cohesive relations that is 
here referred to as referential cohesion. 


1.5 Referential cohesion 


The basic framework for my analysis of referential cohesion is set out in De Busser (2017). 
Halliday & Hasan (1976) consider cohesion to be the aggregate set of cohesive ties, se- 
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mantic relationships that exist between meaningful elements in a text. Cohesive ties are 
directional: they point from a textual source, which will be called the Reference (Rc), to 
a second element, which can exist inside (4a) or outside the text (4b). 


(4) a. Rc points to text-external Rt 
maq aipa Rc [— Rt: object in external reality] 
what DEM.S.DIST.VIS 
“What is that?’ 


b. Rc points to text-internal Ta 
maq-a ainak-a tama-ka ` tu-tuóa tu miqóiq dair?aó 
DEFIN-LNK 1S.POSS-LNK father-K.DIST INTENS-real coMpL difficult large 
aipa. Ta — Rc 
DEM.S.DIST.VIS 


‘As for my father, he really had a lot of difficulties’ 


When this element is linguistic in nature, I call it a Target (Ta). Targets are themselves 
References that point back to previous targets. In doing so, they create cohesive chains, 
referential strands of different length that ^weave' through a text. Eventually, the final 
Reference of each chain points to a concept that exists outside the textual universe; this is 
called the Referent (Rt) of the cohesive chain and is in effect its ultimate Target (Halliday 
& Hasan 1976: 329; De Busser 2017: 107-108). This is schematised in Figure 2. 


Figure 2: Schema of a cohesive chain 


Together with other linguistic mechanisms, such as prosody, event expression, and con- 
textual information, cohesion allows language users to interpret a text as a coherent 
whole operating in a context. It does this by expressing "the continuity that exists be- 
tween one part of the text and another" (Halliday & Hasan 1976: 299). In doing so, cohe- 
sion forms an interface between the local and global distribution of information elements 
in a text. This interaction is given to be complex, but one specific example in which ref- 
erential cohesion interacts with clause-internal information structural devices is by the 
"Theme tell[ing] the hearer where to start from in the interpretation of a message, and 
the cohesive ties signal how the message latches on to other parts of the discourse" (Has- 
selgard 2004: 68). 
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A crucial aspect of Halliday & Hasan” s (1976) model is that cohesive ties are inherently 
semantically motivated, and therefore do not correspond to a single grammatical mech- 
anism or exist between fixed classes of words or other linguistic elements. Somewhat 
contradictorily, Halliday & Hasan partly rely on grammatical criteria to distinguish dif- 
ferent types of cohesion: they make a basic distinction between grammatical cohesion, 
which is expressed by grammatical means such as function words and grammatical con- 
structions, and lexical cohesion, which is expressed through content words. These two 
categories further break down in an assortment of subtypes, based on a combination of 
semantic and grammatical criteria (see Halliday & Hasan 1976: 324). 

This chapter takes a different approach; it focuses exclusively on referential cohesion, 
"the set of cohesive relations that create referring relationships between linguistic forms 
and referents" (De Busser 2017: 107). These are all linguistic expressions that can be tar- 
geted by deixis (or simply, that can be pointed at). In essence, this combines Halliday 
& Hasan's category of reference, with the exclusion of comparative reference, and their 
category of lexical cohesion, with the exclusion of collocation. 

The criterion for establishing referential cohesion is semantic: all words and sub-lex- 
ical elements that are involved in establishing referential meaning are included in the 
cohesion analysis irrespective of their word class or grammatical status. Their involve- 
ment in reference is determined by their ability to be indicated by deictic expressions. 
The only formal restriction is that (with the exception of lexical compounds) phrases and 
other multi-word units are excluded, to prevent the data selection process from becom- 
ing too arbitrary. Word classes that have so far been implicated in referential cohesion in 
Bunun are: (1) nouns; (2) personal pronouns (see Table 1 and Table 2); (3) demonstrative 
pronouns (see Table 3); (4) bound demonstrative articles (see Table 4); (5) the anaphoric 
marker sia and its derivations; (6) numerals; (7) words expressing time, manner and lo- 
cation; (8) question words; and (9) certain verbal roots. 

It is appealing to equate referential cohesion to anaphora resolution, but this is only 
so in a very broad sense. Phoric reference is typically seen as a grammatical property 
of language that is involved in referent tracking and uses fixed morphosyntactic strate- 
gies to establish relationships of identity between expressions and their antecedents (see 
for instance Huang 2000). On the other hand, referential cohesion, though obviously in- 
volved in reference tracking, is a semantic property of language that creates meaning 
relations between two referential expressions. These are not always relations of identity 
(although they can be), and neither do they necessarily have straightforward morphosyn- 
tactic correlates. For instance, the cohesive tie between bantas ‘legs’ and the preceding 
word bunun ‘man’ is meronymic in nature, and the fact that both are nouns is not gram- 
matically determined. 


(5  T3.8a 
itu bunun-d;ia bantas mas buhtur hai insu 
this.here people-pisT.oBr leg OBL joint TPC immediately 


is-tamasaó-an 
TRANSFER-strength-Lv 


‘This man here, his legs and joints immediately became powerful, ..: 
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The fact that cohesion is involved in shaping the general structure of a text suggests that 
it varies between and is therefore indicative of genre (Martin 2001). One of the goals of 
this research is to try to establish the nature of this variation. One possibility suggested 
by Halliday & Hasan (1976: 4) is that "among different genres and different authors in 
the numbers and kinds of tie they typically employ” Another option is that genres vary 
in cohesive density, the number of cohesive elements or cohesive chains relative to text 
length. I will investigate both possibilities in the next section by analysing three short 
text segments. 


2 Cohesion analysis 


2.1 Data 


The present study is a small-scale comparison of oral and biblical narrative as it occurs 
in the Takivatan Bunun speech community in the village of Bahuan (Chinese name: 
Mayuan) at the East Coast of Taiwan. It consists of the analysis of three text excerpts 
(T1, T2, and T3), which are given in their entirety with cohesive elements underlined in 
the Appendices. All three are part of larger narratives; segment boundaries were chosen 
to extract internally coherent sub-narratives. 

The first two texts, T1 and T2, are traditional oral narrative sequences. They feature 
two elderly men, both fluent speakers of Takivatan Bunun and both around 75 years old 
at the time of recording. Text T1 is an account of a hunt and is part of a long story in 
which the speaker, Vau Taisnunan, recounts his life story. In text T2, Tulbus Manququ 
recounts how the traditional Bunun hunters used prophetic dreams to determine the 
appropriate time for the hunt. 

Text T3 is an excerpt from the Acts of the Apostles in the Bunun Bible (Bible Society in 
Taiwan 2000), in which the apostles Peter and Paul heal a cripple. As mentioned in 81.2, 
itisa written text that is the product of a complex translation process. The spelling of the 
original text in the Bunun Bible was adapted to make it consistent with the Takivatan 
Bunun texts and make it adhere to a one-grapheme-per-phoneme principle (see Footnote 


1). 


2.2 Methodology 


All elements in these texts that could be unambiguously identified as having a referential 
function were marked for analysis. Importantly, this means that non-expressed (ellipted) 
elements are not included, despite having a referential value. In contrast with the coding 
scheme in Halliday & Hasan (1976: 329—355), no prior assumption is made about the word 
class (or morphological class) of the elements involved; so far only the nine linguistic 
classes mentioned in §1.5 have been attested in referential cohesive relationships. 

In the data set the following information about each Reference, and the nature of its 
cohesive tie to its Target and Referent are encoded: 


1. The location of a Reference in the text; 
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2. Its word class (see 81.5 and Table 5); 


3. Its Target and the location of the Target in the text (this is not relevant for ex- 
ophoric links); 


4. The Referent of its cohesive chain, i.e. the text-external entity (or event) to which 
the Reference eventually refers; 


5. Theontologicaltype ofthe Target, i.e. whether the immediate Target of a Reference 
refers an event, a location or time, or a textual element (see Table 6); 


6. The phoric status of the cohesive tie, i.e. whether it is a anaphoric, cataphoric, or 
exophoric link (see Table 7); 


7. The relationship between the concept indicated by a Reference and the concept 
indicated by its Target (see Table 8). 


The first four data points provide information about the structural properties of cohesive 
chains; the information in 5-7 relates to conceptual and informational properties of indi- 
vidual cohesive ties. The Referent (data point 4) of referential expressions is indicated by 
unique names that allow us to track cohesive chains. Targets of cohesive ties can belong 
to a number of distinct ontological types (data point 5). Most commonly they are con- 
crete or abstract entities in the real world, but they can also be reified events, physical or 
temporal locations, or textual elements; this is discussed in $3.2. The phoric status (data 
point 6) indicates whether a Reference points to a Target that precedes it (anaphoric ref- 
erence), follows it (cataphoric reference) or exists outside the textual universe (exophoric 
reference). 

Finally, data point 7 encodes the conceptual relationships between References and 
their Targets. Possible values are adapted from the set of relations subsumed under 
Halliday & Hasan (1976: 277-282) category of reiteration, complemented by Peirce's 
fundamental semiotic relationships that exist between signifiers and semiotic objects 
(metaphor, metonym, symbol; see Merrell 2001). The following relationships are distin- 


guished: 


Identity: Relations in which the Reference refers to the same concept as its Target. This 
can be because it is a literal repetition, a personal or demonstrative deictic, a syn- 
onym, or a near-synonym. 


Hyponym/hyperonymico-hyponym: Relations dictated in terms of category membership. 
Hyponyms refer to other concepts of which they are a subclass; hyperonyms refer 
to concepts of which they are a superclass; co-hyponyms are terms that have the 
same immediate superclass. 


Part/whole/co-part: Relations that defined in terms of meronymy (see example 4 above). 


Antonym: Relations based on conceptual opposition. 
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Metaphor: Relations based on similarity, other than identity. 
Metonym: Relations based on proximity or, more generally, contiguity. 


Symbol: Relations based on conventional semantic connections that cannot be reduced 
to any of the previous six relations. 


Originally envisaged to be applicable to lexical cohesion alone, these relations here ex- 
tend to all referential cohesive ties. In the present sample, no metaphoric and symbolic 
relations have been attested. The referential cohesion analysis of the sample texts is in- 
cluded in the Appendix. 

The next section compares the distribution of these data in the three text samples in 
order to investigate the following questions: 


1. How similar or different are oral (T1 & T2) and biblical narrative (T3) in terms of 
referential cohesion? 


2. How do systematic differences manifest themselves? 


3. Given that biblical texts are strongly influenced by the Isbukun dialect, to what 
extent are differences the result of dialect differentiation and to what extent of 
genre differentiation? 


On a more fundamental level, these questions provide an insight in how Takivatan speak- 
ers deal with the genre innovation that biblical narrative has brought to their literary 
repertoire. 

$3.1 discusses the distribution of word classes in the three texts (data point 2), and 
83.2 that of various functional properties (data points 5-7). In 83.3, I look at the global 
properties of cohesion in oral and biblical narrative text. Note again that this exploratory 
study uses a relatively small text sample. 


3 Discussion 


3.1 Word class 


Let's first have a look at how different word classes are involved in the expression of 
referential cohesion. Table 5 shows the distribution of word classes of References in 
the three text excerpts used in the present analysis (please take into account that these 
results are indicative only). 

In line with expectation, nouns are the dominant word class by a considerable margin 
in all texts and both text types: referential cohesion prototypically involves reference 
to concrete or abstract entities, and cross-linguistically these are typically expressed by 
nouns. However, the relative proportion of nouns is significantly higher in biblical text 
than it is in oral narratives (44.59% in T3 vs. 25.45% in T1+T2). In oral narratives, this 
relative scarcity of nouns is offset by a relative abundance of place and time words. In 
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Table 5: Word class of References (Rc) of cohesive ties 


Oral narrative Biblical narr. 
T1 T2 T1+T2 T3 
% # % # % # % # 

anaph. marker 10.71 3 3.70 1 7.27 4 5.41 4 
article 14.29 4 14.81 4 14.55 8 4.05 3 
dem. pronoun 10.71 3 0 0 5.45 3 0 0 
place word 10.71 3 14.81 4 12.73 7 2.70 2 
manner word 0 0 0 0 0 0 0 0 
noun 21.43 6 29.63 8 25.45 14 44.59 33 
numeral 3.57 1 0 0 1.82 1 2.70 2 
pers. pronoun 3.57 1 3.70 1 3.64 2 29.73 22 
time word 14.29 4 7.41 2 10.91 6 1.35 1 
question word 0 0 0 0 0 0 1.35 1 
verb 10.71 3 25.93 7 18.18 10 8.11 6 


TOTAL 100 28 100 27 100 55 100 74 


the absence of any indication that these word classes behave differently in the Takivatan 
and the Isbukun dialect, the most likely explanation for these discrepancies is that it is 
a genre distinction. In oral narratives, especially when they concern historical accounts 
of a personally experienced past, the temporal and geographical anchoring of events is 
probably more important than in stories of a distant past that are mainly intended as 
moral lessons. Conversely, Biblical narrative often puts great emphasis on the symbolic 
significance of names and places; more than half of the nouns in T3 are proper names. 
This is evident when the distributions of ontological types of Targets are compared in 
both genres (Table 6 below). 

A second categorial inversion between the two genres can be observed in the distri- 
bution of demonstrative (anaphoric markers, determiners, and demonstratives) and per- 
sonal deixis (personal pronouns). In oral narratives, demonstrative reference accounts 
for 27.27% of all referential expressions, and person pronouns for a mere 3.64%. In the bib- 
lical excerpt, personal pronouns make up 29.73% of all References, and the three demon- 
strative classes combined only 9.46%. It is not clear how this discrepancy can be explained 
as a genre distinction: oral narratives represent a highly speaker-centric form of story- 
telling and one would assume a relatively high proportion of personal pronominal refer- 
ence. In this case, the difference is more likely due to dialect variation. For instance, as 
suggested in §1.3, bound determiners are much (the data suggests three times) more com- 
mon in Takivatan than in other dialects including Isbukun. We also saw that, whereas 
Takivatan has a highly developed free demonstrative paradigm, putative demonstratives 
in Isbukun are all complex forms involving deictic determiners. Conversely, the Isbukun 
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pronominal paradigm is more complex than that in Takivatan. In the sample, the most 
highly developed deictic paradigm also has the highest relative frequency in each lan- 
guage variety. 

Interestingly, verbs sometimes express referential cohesion. This happens most com- 
monly with verbs derived from nouns or locative words. 


(6) T2.1b 
tanus-an mati-bahi. 
first PROG-have.prophetic.dream 


^... they interpreted a prophetic dream beforehand’ 


(7 T12b 
mina-ba?av tupa naip tu: 
ABL-high.location say DEM.S.NVIS COMPL 


^... Tiang had returned, he had come back from the mountain and told us: ..’ 


3.2 Conceptual dimensions 


This section investigates the distribution of referential cohesive elements in terms of 
their conceptual-semantic properties (data points 5-7 in 82.2). It first looks at the onto- 
logical type of the Target. As mentioned above, referential expressions indicate concepts 
that can be targeted by deixis. One would assume that the prototypical Target of a ref- 
erential cohesive expression is a material entity of some sort. Table 6 shows that this is 
not always the case. 


Table 6: Ontological type of the Target of the cohesive tie 


Oral narrative Biblical narr. 
T1 T2 T1«T2 T3 
% # % # % # % # 
Entity 39.29 11 29.63 8 34.55 19 78.38 58 
Event = = 18.52 5 9.09 5 1.35 1 
Location / Time 60.71 17 48.15 13 54.55 30 16.22 12 
Text = E: 3.70 1 1.82 1 4.05 3 


TOTAL 100 28 100 27 100 55 100 74 


In biblical narrative (T3) entities indeed make up more than two-thirds of the Targets of 
cohesive reference, the majority unsurprisingly people or concrete objects in the narra- 
tive world, e.g. Pitilu ‘Peter’ (T3.1c), naid;ia ‘they < Peter and John’ (T3.3c) or kim ‘gold’ 
(T3.6b). However, in the oral narrative sample this is only one-third. More than half 
of Targets in T1 and T2 refer to a spatial or temporal location, such as laqaiban ‘route’ 
(T1.3b), 7ita ‘there (distal) (T2.2a) or [tupa]-ka '[say] at that time’ (T2.4g). In the previous 
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section, I already explained that this discrepancy is genre-related. Traditional oral narra- 
tives in Bunun culture are typically anchored in the immediate spatio-temporal context; 
in Biblical stories, on the other hand, identifying time and place is only of secondary 
importance relative to the need to keep track of people and objects that populate an 
unfamiliar narrative universe and commonly have a symbolic significance. 

Counterintuitively, 9.09% of referential expressions in T1 and T2 and 1.35% in T3 refer 
to events. These generally are instances of event reification: events are reinterpreted 
as quantifiable objects, with a certain materiality and well-defined boundaries (Quine 
1985; Zacks & Tversky 2001). Finally, a number of referential cohesive ties have a meta- 
textual function: rather than referring to anyone or anything in the narrative universe, 
they point towards part of the text itself. This type of cohesive tie corresponds to what 
Himmelmann (1996) and others refer to as discourse deixis. In Bunun dialects, these 
discourse deictic links are always expressed by sia, which in Takivatan, and possibly 
also in other dialects, is a specialized anaphoric marker and typically refers back to a 
phrase, clause or larger text segment in the immediate context (for shorter segments, 
typically the preceding sentence). For instance, in T3.3b the form sia in sia masaninsin 
pisvanduan ‘the aforementioned Holy Temple’ refers back to an identical phrase in the 
previous sentence T3.2d, which in turn refers back to masaninsin pisvanduan ‘the Holy 
Temple' (T3.1c), the first mention of this particular Referent in the story. 


Table 7: Phoric function of the cohesive tie 


Oral narrative Biblical narr. 


T1 T2 T1+T2 T3 
% # % # % # % # 


Exophoric 10.71 3 25.93 7 18.18 10 16.22 12 
Anaphoric 89.20 25 74.07 20 8182 45 79.73 59 
Cataphoric 4.05 3 


TOTAL 100 28 100 27 100 55 100 74 


Table 7 gives an overview of the distribution of phoric functions of the cohesive ties in 
the sample. Anaphoric reference is dominant in all genres: most referents central to the 
text are introduced near the beginning and tend to persist throughout the story. This 
also explains why exophora are less common: they often occur towards the front of the 
text. Cataphoric reference is rare and in the present sample is only attested in biblical 
narrative. 

Finally, Table 8 gives a breakdown of the types of conceptual relationships between 
References and their Targets.* It is important to realize that these relationships are con- 
ceptual rather than lexical semantic distinctions: they hold between the concepts indi- 


"Totals in Table 8 do not add up to 100% because exophoric cohesive ties have no associated conceptual 
relationship. 
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cated by referential expressions, and not only lexemes, as is the case in Halliday & Hasan 
(1976). This makes it possible, for instance, to establish a part-whole relationship between 
the noun ?ima ‘hand’ (T3.7a) and the pronoun isaidzia '3s.Poss' (T3.7a). 


Table 8: Cohesive relationship between Rc and Ta 


Oral narrative Biblical narr. 
T1 T2 T1+T2 T3 
76 # % # % # % # 

Identity 67.86 19 40.74 1 54.55 30 55.46 41 
Hyponym 7.14 2 11.11 3 9.09 5 = = 
Hyperonym = = 741 2 364 2 = = 
Cohyponym 2.70 2 
Part = = 3.70 1 1.82 1 8.11 6 
Whole 3.57 1 ES = 1.82 1 2.70 2 
Copart 1.35 1 
Antonym 7.14 2 = = 3.64 2 = = 
Metaphor - 
Metonym 7.14 2 7.40 2 7.27 4 9.46 7 
Symbol - 
TOTAL 92.86 26 70.37 19 81.82 45 83.78 59 


The introduction mentioned that referential cohesion is not necessarily identifica- 
tional and is therefore not exclusively “concerned with resources for tracking partici- 
pants in discourse” (Martin 2001: 38). However, from the data it is clear that this is an 
important aspect of cohesion: in both text genres, around 55% of all cohesive ties estab- 
lish relationships of identity, and their function relates to reference tracking. Among the 
general semiotic relationships (metaphor, metonym, symbol), only metonymy is attested 
in the sample. 

One possible minor difference between genres is that oral narrative appears to prefer 
hyponymic relationships, and biblical texts meronymy. However, this is in all likelihood 
an incidental difference resulting from the biblical story having as its main theme the 
miraculous healing of a physical handicap. Superficially, differences between oral and 
Biblical narratives appear almost non-existent, contrary to Halliday & Hasan’s (1976: 4) 
expectation that genres differ in the types of cohesive ties they employ. 

In conclusion, despite differences between the frequency distribution of word classes 
in the two genres (see §3.1), and despite the fact that they have their origins in different 
dialects of Bunun, oral and biblical narratives are largely similar in terms of the relative 
distribution of phoric properties and types of cohesive ties. The most conspicuous differ- 
ence between the two genres is in the ontological type of the Target: cohesive ties in oral 
narratives have a higher tendency to refer to location or time, biblical narrative tends to 
refer more to material entities. 
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3.3 Global properties 


The final section of this discussion examines the global properties of referential cohe- 
sion in the three Bunun text samples. As mentioned, it has been asserted that one of the 
ways in which cohesion might exhibit genre-dependent variation is through consistent 
differences in its density. In other words, the "number and density of such networks is 
one of the factors which gives to any text its particular flavour or texture" (Halliday & 
Hasan 1976: 52). Biber (1995: 187-193) suggests that this is indeed the case for Korean: 
the degree to which cohesive relations, including referential cohesion, are explicitly ex- 
pressed varies widely between text genres. In this study, I measure density in Bunun text 
in three different ways: 


Referential density: The total number of words relative to the total number of References 
(or cohesive ties) in a text.? Referential density gives a general impression of how 
much real estate cohesive referential expressions take up in a text. Note that it 
does not really measure which percentage of words are referential expressions, 
since References can be morphemes and a single word can therefore contain more 
than a single Reference (see e.g. daida-ki 'there-K.PRox' in T1.2b). 


Cohesive density: The number of cohesive chains in a text relative to the total number of 
words. This is a proxy indicator of what in the quote by Halliday & Hasan above 
is referred to as the density of the cohesive network, that is, how many cohesive 
chains weave themselves through a text of a normalized length. 


Cohesive referential density: The number of cohesive chains in a text relative to the num- 
ber of References. This measure indicates the average length of cohesive chains in 
a text, in terms of its average number of referential expressions. 


Table 9 calculates these three density metrics for the three texts and the two genres in 
the present sample. 


Table 9: Global properties of the text segments 


Oral Biblical 
T1 T2 T1«T2 T3 

# of words 62 80 142 179 
# of referential expressions (Rc) 28 27 55 74 
# of cohesive chains 7 9 16 19 
Referential density (words / Rc) 2.214 2.963 2.582 2.419 
Cohesive density (chains / words) 0.113 0.113 0.113 0.106 
Referential cohesive density (Rc / chains) 4 3 3.438 3.895 


This measure is equivalent to Abadiano's (1995: 308) cohesive density. 
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While referential density and referential cohesive density both seem to be vacillating 
around a central value, the most surprising result is that the value for cohesive density 
is almost completely equal (0.11) across texts and genres. Especially in a small sample, 
where a certain degree of instability is expected, it is not very likely that this is a spuri- 
ous result. This is very much against initial expectation, as cohesive density is one of the 
factors that one would most expect to vary across text types. For instance, in planned 
written text, such as our biblical narrative, tracking entities and spatio-temporal loca- 
tions is cognitively less demanding than in oral narration, where visual cues that allow 
the listener to reaffirm the status of activated concepts are not available. The basic as- 
sumption would therefore be that written narration does not need to be as cohesively 
dense as oral narration. 

Not only is this not the case, the present sample suggests that cohesive density is a 
constant, at least in Bunun. This is the opposite of "the possibility of cross-linguistic 
universals governing the patterns of discourse variation across registers and text types" 
that Biber (1995: 359) is looking for: what we have here is a property of the supra-clausal 
information structure of language that appears to be impervious to personal or genre- 
based variation. The reasons for the stability of this value are at present unclear. One 
possibility is that languages have a tendency to evolve towards a cohesive equilibrium, 
in which texts are as cohesive as necessary to make them coherent but not more so, 
an equivalent on a textual level of Haiman's (1983) competition between iconicity and 
expressiveness. 


4 Conclusion 


This leads us to an unexpected conclusion. Despite the evident grammatical differences 
between oral and biblical narratives in the sample, caused by dialect and genre differ- 
entiation, the conceptual properties of their underlying referential cohesive structures 
are surprisingly similar: against initial expectation, no major systematic differences can 
be observed in the phoricity or functional type of cohesive relationships. Even more so, 
the data suggests that, in defiance of lexical and grammatical variation in the two genres 
and dialects, the cohesive density of the two genres under investigation is invariant. This 
may point towards a cohesive constant underlying the structure of Bunun texts, though 
further research will need to verify this. 

There are a small number of systematic differences between the two text genres. In 
terms of the referential type of the concepts they encode, referential cohesive ties in 
oral narrative tend to refer more to spatial or temporal location and those in Biblical 
narrative more to entities. This corresponds to a predilection for place and time words 
in the former genre, and for nouns in the latter. I argued above that this distinction is in 
alllikelihood due to genre-based informational demands. On the other hand, a contrast in 
the frequency of demonstrative and personal deixis is probably rooted in dialect-related 
grammatical differences. 

The present study is intended as a pilot, a fact-finding mission. Despite its modest 
data set, it has come up with interesting and unexpected results, but future research 
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is necessary to test whether the present results will stand when tested against larger, 
statistically valid and more diversified data sets, and to find out whether regularities 
can be found in any of the lower-level categories. A number of questions regarding the 
invariance in cohesive density need to be answered. Will the cohesive density constant 
hold up in a larger sample with more genre distinctions and dialects? If so, how can it 
be explained? And does a similar phenomenon exist in other languages? 
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Abbreviations 

? function unknown GNR generic 

1E first person exclusive HESIT hesitation marker 

1S first person singular HUM human 

2S second person singular INCH inchoative 

3P third person plural INTENS intensifier 

38 third person singular INTER interjection 

ABL ablative prefix expressing IRR irrealis 
movement from LNK linker 

ALL allative prefix expression LOC locational prefix expressing 
movement toward position in or at 

ANAPH anaphoric marker LV locative voice 

ART article MED medial 

ASSOC associative N noun 

ATTR attributive marker NEG negator 

AV actor voice NSUBJ non-subject form 

CAUS causative NUM numeral 

COMPL complementizer NVIS non-visible 

COORD coordinator OBL oblique case marker 

cv CV reduplication P plural 

D1 determiner paradigm 1 PAUC paucal 

D2 determiner paradigm 2 PLACE place word 

DEFIN definitional marker POSS possessive 

DEM demonstrative PROG progressive 

DIST distal PRON pronoun 

DYN dynamic PROX proximal 

EMO emotive PRT particle 

ENUM enumerator PRV perfective 
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9 
RECIP 


RESOBJ 
S 

STAT 
SUBJ 
SUBORD 


1 Referential cohesion in Bunun: A comparison of two genres 


question word TIME time word 
reciprocal TPC topicalizer 
resultative object USPEC underspecified 
singular UV undergoer voice 
stative V verb 

subject form VIS visual 
subordinator 


Appendix 


T1: Segment oral narrative 


Source: Takivatan Bunun Corpus 

Corpus location: TVN-008-002:130-134 

Speaker: Vau Taisnunan, M, 75 y 

Location and time: Bahuan (Mayuan), 2006 

The excerpt below was previously published as example 22 in De Busser (2017). 


Text 


(8) Aupa tuða nian tu nanu sanavan minsumina Tian, minaba?av tupa naip tu: 


a. aupa tuda ni-an tu nanu sanavan min-suma-in-a Tian 


thus real NEG-PROG COMPL really evening INCH-return-PRV-LNK T. 


mina-ba?av tupa naip tu 

ABL-high location say DEM.S.NVIS COMPL 

“But, when it wasn't really evening yet, Tiang had returned, he had come 
back from the mountain and told us: ..? 


(9) Na, maqtu laqbinina, na?asa dusa ta matiskun, malu?umi han ba?av daióaki, 
pinkaunun isian ba?avta, nabul. 


a. na maqtu laqbinin-a ^ na-asa dusa-ta | ma-tiskun 


well be possible tomorrow-LNK IRR-have.to two-T.DIST DYN-in.a.group 


malufum-i han bafav daióa-ki 
disperse-prT be at high.location there-K.PROX 


c. pinkaun-un i-sia-an ba?av-ta gabul 


go.up-UV — LOC-ANAPH-LV high.location-T.DpisT deer 

"Well, tomorrow is possible, two of us will have to go together, and disperse 
when we get to this place, and we will climb upwards to the deer that is in 
that place above? 


(10) A, namaqaisaq dauka, saqnutai du sia ?ukai laqaiban. 


a.a na-ma-qaisaq dau-ka 


INTER IRR-DYN-in.that.direction EMO-K.DIST 
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b. saqnut-ai-du sia fuka-i laqaiban 
q 
get.stuck-PRT-EMO ANAPH NEG.have-PRT route 


‘A, if he will go in that direction, he will get stuck there, without a way out. 


(11) Mei, mei kahaul duna ?uka duduma laqaiban, aupa tuða, maupa tupina. 


a. mei mei ka-haul dun-a 
already already come.from.below line-LNK 


b. ?uka du-duma laqaiban 
NEG.exist RED-other route 


c. aupa tuóa maupa tupa-in-a 
thus real thus ` say-PRV-LNK 


"Ihe track is coming from below, and there is no other way out, it really is 
like that, thus he told us? 


(12) Ansaisaga Atul Dain tu “nis, matinmutin tamudana madav.” 


a. ansais-an-a Atul dain tu 
forbid-PRoc-ENUM A. large COMPL 


b. ni-is ma-tinmut-in ta-mu-dan-a | maó?av 
NEG-3$.F STAT-morning-PRV ?-ALL-road-LNK embarrassed 


“But Big Atul forbade us: “no, when it has become morning, we will leave, it 
is embarrassing: 


(13) Na, ?ukin aipa fita namudanin, musbai naipa maqmut. 
a. na ?uka-in aipa fita na-mu-dan-in 
well NEG.have-PRV DEM.S.DIST.VIS there.DIST IRR-ALL-gO-PRV 


b. mu-isbai naipa maqmut 
ALL-cause.to.move DEM.S.DIST.NVIS night.time 


"Well, it will not be there anymore, it will be gone, it will have run away 


during the night: 


Cohesion analysis 


Table 10 contains an analysis of referential cohesive elements in text T1 above. Numbers 
in the headers refer to the data points referred to in 82.2. 
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urÁuoləd 3uoudeuy SUIT], /90T Ap jo eum Sururour, [ul-Jinwiny[-pu] op AWIL um ystu, ;mubpu — qo 
Kiruop ououdeuy A0? loop SIALsITS'Wa3qd,Ddip eg Waq SIAN'ISIq'SWaq Ddipu q9 
urkuojuy ououdeuy SUL /IOT 1əəp uoneoo[ Mojeq, [ney[-ey] ep ed ISITI], DI — v9 
Kruop ououdeuy A0? loop Xst,s[m] | qe Waq Sta Ltd gä, Dn eg 
urKkuodÁH ouoydeuy SUIT], /90T Ap jo eum XOUd LS I1AV, 1y-[DeIDp] | qz AWIL ,Suruzour, [ui-Hmuün[-pu] qs 
Á1quopI »uoudeuy Ride loop udeug pis qç Noud AXSst,s[m] qc 
Dpuapnt ` ouoqdeuy Am Tuv [9zt:z00-g00] (mv 31g attwisra oy, (app jnzy-svu) N mv sig, np mit ee 
Áruop ouougdeuy sump/2o0T Jaap 21001 Qui unp ep N or, ueqrebe qp 
K3ruop ouougdeuy sump/2o0T Jaap 21001 mor upqipbv| qç N Qu unp ep 
urkuojuy ououdeuy aU] /I0T Jaap uoneoo ASIGLAD LUV, py-[npp-bosipb-pui-pu] ee A Mojeq, neu[-ex] er 
ot0ydoxy sump/2oT 1əəp anor N nor, upqimbv| qç 
Á1quopI »uoudeuy A0? loop Jesp,jnqpü oz HdVNV ,HdVNV,DIS qg 
Kiruop ououdeuy SUL /I0T Jaap uoneoo uoneoo[ YSIY, »j-ApgvQ XZ Luv Jet LAT LAV, py-[npp-bvsib-pw-vu] ee 
Kiruop ouougdeuy Ride loop JAV pn: OZ N I99p,[nqpü oz 
Kiruop ououdeuy Ann? 199p uomneoo ,HdVNV,DIS OZ Lav ASIGING LUV, DJ-[Angoq] | oc 
Kiruop ououdeuy 9ump/2o] Jaap uoneoo Quo yey] jo eJd ayy, up-p:s-1 — oc ed ,uoneoo[ qgrq, Io-Jnmmg oz 
DG) »uoudeuy Áu loop [Sz1:200-800] Lead 1007, vuvdop) HdVNV ,HdvNv, [uv-]vis[-1] as 
Áiruop ououdeuy SUL /I0T Jaap uoneoo aoe yey, Domp — qz HdVNV Quo jeu jo e»e[d ay}, uv-vIs-1 — oz 
Kiruop ououdeuy sump/2oT Ap jo eum ,Moi1louroj uiüiqbo] eg Lav XOWalLAslwv,ny-[vgmp] — qc 
Kiruopr ouougdeuy jusAq = Jaap uoneoo ,Uuoneoo[uSm avgoq qz ed aoe yey, [14-]vgip — qz 
Kruop ououdeuy SUL /I0T Jaap uoneoo SITINDLUV, D]-[psnp] e A ,UoHneoo[ US, apzog qz 
patt ououdeuy SUL /I0T Jaap uoneoo ,urejunour əy} uro1j euroo, anzpq[-punu] ot Luv SI@LNGLUV, Dj-[vsnp] es 
mod 3rroudoxq Ride E) (1x91 "Ao1d) WON OM], Di-Denp 2z 
urÁuoləxd ououdeuy SUIT, /90T Ap jo eum SumuaAs, upapups VJ AWIL Morowo}, [r-Jutüqbv] ez 
Áquopr »uoudeuy A0? IL Lat et waa SIAN'S'WIA, dipu qI 
Áy4uəp] ouougdeuy sump/2oT Jaap uoneoo (1x91 aard) A  ,urejunoui əy} ou euroo, anzpq[-punu] or 
Áquopr »uoudeuy e Det. (1x93 `Aə1d) N LL, (DT et 
3rroudoxq ƏurL/2o1 Ap Jo eum SWIL ,SuruaAs, upanuns VI 

()vr?u (9)uonoun (S) eL opor (c) ssep 
EH 9uoqq  [euonoung (p) WIA (e1) SIL | (c) prom 324 (o9u)e»uergew (I) 


LL SrsA[eue uorsaqoo EYUA 01 AQEL 
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T2: Segment oral narrative 


Source: Takivatan Bunun Corpus 

Corpus location: TVN-012-001:38-41 
Speaker: Tulbus Manququ, M, 75 y 
Location and time: Bahuan (Mayuan), 2006 


Text 


(14) Maqai maqabasi tupa tu madain?adi namuqumaka tanusan matibahi. 
a. maqai ma-qabas-i tupa tu ma-dain fad-i 
if DYN-in.former.times-PRT say COMPL STAT-old-PRT 
na-mu-quma-ka 
IRR-ALL-field-K.DIST 
b. tanus-an mati-bahi 
first PROG-have.prophetic.dream 


‘If in the old days the elders said they wanted to work on the land, they 
interpreted a prophetic dream beforehand’ 


(15) Namaqun ?ita maqai masihala bahia, tudip, na, sintupadu tu maqai ?itun asa na- 
masihal kakaunun. 
a. na-maqun fita 
IRR-cut.off there.DIST 


b. maqai ma-sihal-a bahi-a tudip 

if STAT-good-suBORD prophetic.dream-suBonp that.time 
c. na sin-tupa-du tu maqai itun 

well RES.OBJ-say-EMO COMPL if there.MED 
d. asa na-ma-sihal ka-kaun-un 


be.able IRR-STAT-good cv-eat-UV 


‘And when they wanted to go there to harvest (lit: when they wanted to cut 
off things in that place), if the dream was good, that meant in those days that 
if you were there, you could eat very well’ 


(16) A maqai dipi madiqla bahia tupa tu asa ni ?ituni nalauq, nitu na ... masihala 
kakauna sanasia maqai, amin tu maqai Zitun namuqóa kuóaki madiqla bahi, na 
haida mataó. 

a. a maqai dip-i ma-diqla bahi-a 
INTER if then-PRT STAT-bad prophetic.dream-LNK 
b. tupa tu asa ni fitun-i 
say COMPL have.to NEG there. MED-PRT 
c. nalauq ni tu na ma-sihal-a ka-kaun-a 
otherwise NEG COMPL well srAT-good-LNK cv-eat-LNK 


28 


(17) 


1 Referential cohesion in Bunun: A comparison of two genres 


d. sana-sia maqai 
ACCORDING.TO-ANAPH if 
e. amin tu maqai fitun na-muqóa kuda-ki 
all cowrrif there.MED IRR-again work-K.PROX 


f. ma-diqla bahi 
sTAT-bad prophetic.dream 


g. na haióa mataó 
wellhave die 


‘And if the dream was bad, then they said that you must not go there, 
because otherwise you would not eat well, if you followed the rule, but if 
anyone at all went back to that place to work, and there was a bad dream, 
people would die’ 


A, maqai mataisaq ... matataisaq a madadain?aó tu, ... maqai mun?ita?a mavia 
mataisaq tu sadu?uki siatu, sinsusuaó bunuaó masmamua mavisqai, mavilasa tu- 
paka madadairaó tu na maqtu munquma ista?ai nakasihalain kakaunun namasi- 
hala bunun. 


a.d maqai ma-taisaq 
INTER if DYN-dream 
b. ma-ta-taisaq a madadain?ad tu 
DYN-CV-dream INTER elder COMPL 
c. maqai mun-?ita a ma-via ma-taisaq tu 
if ALL-there.DIST HESIT DYN-why DYN-dream coMPL 


d. sadu?u-ki sia tu 
See-K.PROX ANAPH COMPL 


e. sin-su-suaó bunuaó mas-ma-muav | ma-visqa-i 
RES.OBJ-CV-sow plum — BE-cV-excessive srAT-abundant.with.fruit-PRT 


f. mavi-las-a 
CONTAIN-fruits-LNK 


g. tupa-ka madadain?ad tu 
tell-K.pisT elder COMPL 


h. na maqtu mun-quma ista-ai 
well be possible to ALL-field ^ 3s.DIST-PRT 

i. na-ka-sihal-in ka-kaun-un 
IRR-ASSOC.DYN-good-PRV CV-eat-UN 

j. na-ma-sihal-a bunun 
IRR-STAT-good-LNK people 
‘And if they dreamt... if the elders dreamt that, if they went over there, they 
suddenly dreamt that they saw that the plum tree had grown so that it was 
full of fruits and had large fruits, then the elders would say that it was 
permitted for them to the land to work, and they would produce good fruits, 
and the people would also be fine: 
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uduosad APT ououdeuy mm? S1op[o siat, fiv-Jojst1 uy N Əldoəd ununq fp 
AanuəpI ououdeuy mm? S1op[o ,S4apja, gogüivpoppu — By NOUd Lsirst, [m-]vi1 ur 
Ayquepy ououdeuy oum/20T pue  Ə1əun 1940 (08), pu [-unu] op A PIY eu o1 o8, pwnb[-unu] | ur 
Ayquepy ououdeuy mm? S1op[o ,HdVNV,DIS pp N ,S4ap]a, gogüivpoppu — By 
yed ououdeuy 9um[/2oT Soul I9ULI0J UI usw, [1-Jdip ee Lav am yey} ye (Aes) px-[pdnj] 3p 
uAuorad APT ououdeuy mm? 921) um[d mid. gpnunq — ap A ,Simag (Jo mt əq), [v-]soj[-iamu] — Jy 
onoudoxq mm? 921) um[d N umjd,gpnunq ər 
AanuəpI ououdeuy mm? S1op[o ,SI9ppe, Qpgüivpoppu qr HdVNV ,HdVNV,DIS py 
urKuojojq ououdeuy uaAq weap weap ogeudoud, nq — je luv Aa (ses) my-[ngnpvs] pr 
Ayquepy ououdeuy oum/2oT pueg Om aa), unity ` og A 19g 1940 (o3) vyg[-unu] ə 
Ayquepy ououdeuy mm? S1o9p[o Sopp, [1-]gpg-ümp-pwu er N ,SI9p[o, gnzüivpoppu qr 
matt ououdeuy Daa ureo1p weap oneudoad, [p-]tjpq eg N weap onəudoid npq Jg 
Ayquepy ououdeuy oum/2oT pueg Om aa), unity — ae luv aoeyd sm ur (304 ayy), »y-[pony] ae 
Áiquopr ououdeuy oum/2oT puer ,dqawereq,[r]ungj oe Eër? Qaw'ənul unu əç 
ououdeuy ƏL 1x91 ^  ,peuonueurojge əy} 0} Surp1oooe, pis[-pups] pe 
Áiquopr ououdeuy oum/20T pueg ¿LSIQ'ƏI9U1 Du VZ 14 IWY, [r]ung oe 
urKkuodÁH ououdeuy Daa ureo1p weap orjeudoad, [v-fiyoq ` qz N (weap oreudoad, [p-]ypq — ec 
urÁuolə]w ououdeuy 9um[/2OT Soul rəuuojur SW yey}, dipny qz AWIL uew,[r-]dip ee 
ououdoxq oum/2oT att ERAI TIWI}, unu o 
uráuodÁH ououdeuy 9um[/2oT Soul I9ULIOJ UI ,SÁep po am ur, 1-[spqvb-pw] er AWIL Qum PY, dpn} qz 
uikuodAH ououdeuy Daa ureo1p ,ureo1p oyeydoid e aaey, mupdl Am! — ot N weap orjoudoad, [v-Jryoq — qz 
Kiop ouougdeuy oum/2oT pure, pueg ay} uo ou 08 [py-]rwnb[-unu-vu] er aOW1d <LSIG'9I9Y}, DZ VZ 
onoudoxq uaAq ureə:p A ,ureo1p syeydoid e aaey, mupdl Am! qi 
ououdoxq oum /2oT puer Gogo luv Ə1əu1 1940, py-[pwunb-unwu-vu] e 
ououdoxq oum/2oT purl A (Pury 201 uo oa 08 [py-Jownb[-unwu-vu] er 
ououdoxq mm? S1op[o N  S1əp[ə, [1-]gpg-üivp-owu et 
ououdoxq SUL /IOT Soul :əuuojur A ,SÁep pjo am ur, b-Jepanb-ou er 

()*ro?w (9)uonoung (S) eLə (z) sse 
Tu »Hogd  [euonounjg (p) 1uo19joy (BL) Bre (e) prom (w) vuy (D 
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T3: Segment Biblical narrative 


Source: Tama Dihanin tu Halinga. The Bunun Bible in Today's Taiwan Bunun Version (Bible 
Society in Taiwan 2000) 
Corpus location: Acts 3:1-10 


Text 


(18) 


(19) 


Aida tu hanian, masa taunhuvalin tu dintau, Pitilu mas Iuhani hai kusia Masaninsin 
Pisvanduan. 


a. 


faióa tu hanian 
exist COMPL day 


masa taughuvali-in tu din-ta 
WHEN noon-PRV ATTR HOUR-three 


Pitilu mas Iuhani hai kusia masaninsiyn pisvagduan 
Peter OBL John ` pc use holy temple 


“Ihere was a day, when it was three at noon, that Peter and John were using 
the Holy Temple’ 


Isia tupaun tu Manauad Ilav dia hai, aida tu tadini maisna taus?uvaóun mapiha, 
kaupa hanian ansahanun mas bunun mapunsia ilav dia, makikisaiv mas 
nakunadah sia Masaninsin Pisvanduan tu bunun. 


a. 


i-sia tupa-un tu manau?ad Zilav-d;ia hai 

POSS-ANAPH Say-UV COMPL beautiful door-DIST.OBL TPC 

faióa tu tadini maisna taus?uvaó-un ma-piha 

exist COMPL one.HUM from ` give.birth-uv srAT-cripple 

kaupa hanian ansahan-un mas bunun ma-pun-sia tilav-dia 
each day carry.to-UV OBL person DYN-CAUS.ALL-ANAPH door-DIST.OBL 
ma-ki-kisaiv mas na-ku-gadah sia masanigsir pisvagduan 
DYN-RED-give OBL IRR-ASSOC.ALL-lower ANAPH holy temple 


tu bunun 
ATTR person 


‘At what was called the Beautiful Gate, there was one man who was cripple 
from birth, and people carried him every day and put them at that door, and 
he begged to people that went down into the Holy Temple: 


(20) Sadu saia tu Pitilu mas Iuhani hai nakunadah sia Masaninsin Pisvanduan, at maki- 
saiv naidia. 


a. 


sadusaia tu 
See 3S.NOM COMPL 
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b. Pitilu mas Iuhani hai na-ku-gadah sia masaninsin 
Peter OBL John TPC IRR-ASSOC.ALL-lower ANAPH holy 
pisvagduan 
temple 


c. at makisaiv naidia 
and make.give 3P.OBJ 


“He saw that Peter and John were about to enter the Holy Temple and made 
them give (money) [tried to ask them for money]: 


(21) Naia hai samantuk said;ia tupa Pitilu tu: “Sadua kasu madami!” 
a. naia hai samantuk saidia 
3P.NOM TPC keep.close.watch.on 3s.oBr 
b. tupa Pitilu tu 
say Peter COMPL 
c. sadu-a kasu | ma-óami 
See-LNK 2S.NOM DYN-1E.OBL 


p» 


"Ihey looked straight at him, and Peter said as follows: "You look at us 


(22) Saia hai samantuk naidia, asa usióan maómaó. 
a. saia hai samantuk naid;ia 
3S.NOM TPC keep.close.watch.on 3P.0BL 
b. asa u-sida-an maómaó 
want ABLE.TO-take-Lv which.things 


“He looked straigth at them, he wanted to be able to get something from 
them: 


(23) Pitilu hai tupa saidgia tu: “Ukan saikin kim mas sui, haitu nasaivan ku kasu mas 
inak tu iskakaupa: Mapakasia saikin mas itu takisia Nadale tu lesu Kilistu tu nan 
tupa masu tu, mindankada mudan!” 

a. Pitilu hai tupa saidia tu 
Peter TPC say 3S.OBL COMPL 

b. ?uka-an saikin kim mas sui 
NEG.have-Lv 1s.TOP.AG gold COORD money 

c. haitu ` na-saiv-an-ku kasu masi-nak tu | iskakaupa 
although 1rR-give-LV-1S.NSUBJ 25.NOM OBL POSS-1S.N ATTR everything 

d. ma-paka-sia saikin mas itu taki-sia Naðale tu 
DYN-RECIP-ANAPH 1S.TOP.AG OBL this.here ORIGIN-ANAPH Nazareth ATTR 
lesu Kilistutu yan 
Jezus Christ ATTR name 


e. tupa masu tu 
Say 2S.0BL COMPL 
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mindankad-a mu-dan 

stand.up-LNK ALL-go 

"Peter told him: “I do not have gold or money here, but I will give you 
everything I have here. I use the name of Jesus Christ who comes from 
Nazareth to tell you: stand up and walk" 


(24) Pitilu hai ma?alak mas isaidia tu tanaskaun ima, sidankaó said;ia. 


(25) 


(27) 


a. 


Pitilu hai ma-?alak mas isaidjia tu ` tanaskaun ?ima 
Peter TPC DYN-lead OBL 3S.POSS ATTR right hand 
si-dankaó saidia 

?-stand — 3s.oBL 

‘Peter led him by the right hand, and helped him to stand’ 


Itu bunun dia bantas mas buhtur hai &insu istamasaóan, at mataidaóa, matudul- 


dul, kityab mudadan. 


a. 


itu bunun-dia bantas mas buhtuy hai insu 

this.here people-pisT.oBr leg.and.foot opt, joint TPc immediately 
is-tamasaó-an 

TRANSFER-strength-Lv 

at mataidaóa matuduldul kitnab mu-da-dan 

and jump stand begin ALL-RED-road 

"Ihis man here, his legs and joints immediately became powerful, and he 
jumped up and stood, and he began to walk 


Saia hai taskun naid;ia kugadah sia Masaninsin Pisvanduan, madsishahainaó mu- 
dadan, matumashir mas Sasbinaó Dihanin. 


a. 


saia hai taskun | naid;ia ku-gadah sia 

3S.NOM TPC do.together 3P.NSUBJ ASSOC.ALL-lower ANAPH holy 
masaniysin pisvagduan 

temple 

mad;ishahainaó mu-da-dan | matumashir) mas Sasbinaó Dihanin 
gleeful ALL-RED-road thank OBL God 

"Together with them he entered the Holy Temple, and gleefully walk over 
and he thanked God: 


Bunun hai sadu saidjiaa tu mudadan, at matumashir mas Sasbinaó Dihanin, at 
&inhuda, au pa sahal naia tu saia hai takisia Masaninsin Pisvanduan tu Manauad 
Tou malfanuhu makisasaiv tu bunun. 


a. 


bunun hai sadu saidjia-a tu mu-da-dan 
people TPC see 3S.OBL-LNK COMPL ALL-RED-road 


at matumashir mas Sasbinaó Dihanin 
and thank OBL God 
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C. 


at djinhuóa 
and be.startled 


aupa sahal naia tu 
because clearly 3P.NOM COMPL 


saia hai taki-sia masanipsir pisvanduan tu 

3S.NOM TPC have.origins.in-ANAPH holy temple ATTR 
manau?aó Zilav malfanuhu sa-makisaiv tu bunun 

beautiful door sitdown  srE-beg ATTR person 

‘People saw him walk, and thank God, and they were startled, because they 


recognized him as the man that used to beg sitting down at the Beautiful 
Door that was the entrance to the Holy Temple: 


A comparison of two genres 


1 Referential cohesion in Bunun 


Kiop] »uoudeuy us uyot mot, mmunt — or N mot mupynt qç 
Dmmapt ` "ode pm? 13d Je d,npid N N Jed, nud qç 
Ájquop] »uoudeuy Am? ə[ddrro ƏIddio-rvrs piidpu qz NONd NON'SE, DIDS eg 
urKuodÁqo? ououdeuy Ayqug əjdoəd ,uos1d unung oz N ,uosied, ununq pz 
Áiquop] »uoudeuy Ayqug Ə[durə1 ,HdVNV-SSOd,UIS-] Eé N  Əldurər, ÁJOH oui, upnp(ipasid Wrstuunpen pz 
Ájuop] »uoudeuy IXƏL ə[durə1 durer Apog eui, upnpü(pasid (1s(tupspuu oT HdVNV ,HdVNV,DIS pc 
urÁuojoj i ououdeuy eum/20T  e[dure uoreoo A180'LsIq-100p, [v1P-Javjiy ae EëVet? ,ITe^o[-rIvoossv-mi, yopvh-ny-vu — pc 
AanuəpI »uoqgdeuy eum/20T  e[durej uoreoo pn gogynvuvu] e Liv THO LsIa-1oop, pifp-[avpg] ae 
urÁuojo]q »uoudeuy Ayqug Joop ,HdvNv-TIv'snvo-Naq, pis[-und-pwu] as N "180 LsIa-100p,[vifp-]avqjy as 
urÁuojoj ououdeuy Ayqug Joop (Igo'rsItq-roop [ngyrneoq, [v1 S-Janjiy ommnnunu es A ,HdVNv-TIV'snvo-Nad, pis[-und-pu] ae 
ououdoxg ÁAnug əjdoəd N uosIəd, ununq o 
»uoudoxg eum/2o7 Aep Araaa HWIL ¿ep Vum oz 
AauəpI ouoydeuy Ayqug ə[ddrro Ant ao. Tun) qz N td us, pyidvu qz 
»uoudoxg Ayqug ə[ddrro WAN nn ao, um qz 
urÁuo19lw »ououdeuy eur/20T  e[durej uoreoo[ ,HdVNV-SSOd,DIS-] — vg INV ` /IHOLSIq-100p [ngrineoq, vifp-[av[ggy evjnvuvu] e 
yed »uoudeuy Ayqug Joop ,HdVNV-SSOd,DIS-] Ee N  J18OLSIG-100p [ngyineoq, [piSp-Javjgy goznouvw e 
Áiquop] »uoudeuy Am? Eu a[duray, ÁToH oui, ubnpüpasid (isürupspwui oT HdVNV .HdVNV-SSOd, DIS-1 Eé 
ououdoxg Ayqug Əldurə1 N  Əldurər, ÁJOH ey}, uvnphvasıd shjuvsow ` ot 
»uoudoxg Ayqug uuof N uuof,rupunp — or 
»uoqdoxg mu? BPA N panud ` at 
yed ououdeuy OUT], /907 Aep jo eum Aep,upiuvy — et WON 209,0 sem, DM qı 
wAuojayy ` oto ydeyea UIT, /907 Aep jo eum opop,oesenp,rrufp qr ^ ,uoou əutooəq Suraeu, [ur-]ippanyünp: ` qr 
»uoudoxg aui /20 Á10js jo eum N App. mum “x 

Letz (9)uonounjg (S) eL opor (c) sse[ə 
TPA 32uoud  [euonoung (p) PIJN (eL) jə9Sre[ (e) PioA 24 (9q)s»ue1sow (D 


E.L stsÁ[eue uorsaqoo [enuə:zəJjəl] :ZI AGEL 
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Áiuop] ouougdeuy te o[dduo ,SSOd's£, DISIDSI eL NOMNd 190 st IIS qL 
ped ouougdeuy Am — o[dduo jo puey SSO4 SE, DISIDSI eL N pueu, pu — e 
Áiquop] ouougdeuy te o[dduo ,I8O'SZ, NSW 29 NOUd SSOd'S€ DIpws1 — vj, 
Áiquop] ouougdeuy te IER) NON St, uly1Ds po N 194, nd ` EL 
A0uapt onuoudeuy e o[dduo ONSE. NSDY 29 NOUd ,IHO'Sz, nspa — 99 
onoudoxq te ouru N auu, uv po 
ououdoxq e snsof N ASHYO snse[,njsiy nsap po 
Áiuop] ououdoxq oum/20T UjerezeN TEZEN, 2ID00DN po N (YyoIeZzeN, 2jpQoN po 
ÁAjnuep] | ououdejej aU /20T yyorezeN A ,HdVNV-NIOTHO, DiS[-bpb1] — po 
Áiuop] ouougdeuy te 1d N'ST-SSOd, YDU-1 39 NOUd JWON'SL, une po 
ououdeuy juoAq op [9»ueques ə-rquə] ` 29-qo ^ ,HdVNV-dIOdu-NAG, pis[-pypd-pnu] p9 
IOM ououdeuy Ájquq suorssossod ,Áouour, ms qo N SunpÁroAs, pdnp3pysi — 29 
Áuop] ouougdeuy te 13d N'ST-, ny-[uv-mps-vu] 29 Noud ,N'SI-SSOd, YDU-1 39 
Áiquop] ououdeuy te o[dduo ,18O'SE, ps ?9 NOUd JON SE. Hen — 29 
Áiquop] ououdeuy ze IER) NON St, uly1Ds qo NONd Met: ny-[uv-aips-vu] 39 
ped ouougdeuy e suorssossod Sim uon, gptugptu qc N Asuow, ms q9 
ped ououdeuy te suorssossod Sum qon, gptugptu qc N p103, uay — qo 
Áiquop] ouougdeuy te 1d Ad, nd ?9 NOUd JWON'SL, une qo 
Áiuop onuougdeuy te o[ddio WON SE. DIDS BG NOUd Igo's€ DIPS eg 
Áiquop] ououdeuy te IER) ,J939d, NING qr N 19494, HIM v9 
onoudoxq e suorssəssod 6 SSunpruomqA, gpuigpu qg 
Áiquop] ououdeuy Am uyof pue :ələq ,/18O'8I-NAG, rumo[-pui] op Noud Joao At mun ge 
Áiquop] ououdeuy te ə[dduo ONSE. Hen op NOUd JON SE DIDS 8G 
A0uapt ouougdeuy Am uyof pue :rələq WON dE. pipu ep NOMNd ,I8O'dI-NAG, nupg[-pw] ap 
Áiquop] ououdeuy te ə[dduo ,18O'SE, pifpins ep NOUd JON SE. nsDy — Oy 
Áiuop] ouougdeuy e IER) PPA, nd qc N 194, nd qv 
Áiuop] ouougdeuy e ə[dduo WON SE. DIDS eg NOUd 190 st pipis ep 
Áiquop] ououdeuy mug uyof pue 134 Joo At. pifpipu EI NOUd JNON'd£, pivu ep 
IOYM onuougdeuy Am | uyof pue :ələq ,uuo[, 1upqnp + 19494, NING qc NOWUd Joo At pipipu oç 
Áiuop] ouougdeuy bieten əjdwə} ` [dur] Aog oui, upnp(ipasid ürsürupspvui pc N due ÁH eu, ubnpüvasid üisürupspu qg 
Áiquop] ououdeuy Lat. əjdwə} ` atdoat. Aog oui, upnp(ipasid ürsürupspvui pc HdVNV ,HdVNV,DIS qç 
Á1uop] ouougdeuy eum[/20]  e[duroj uono] ,I9^0[-T1V'2OSSV-niII, t/ppp-n»-pu pc ^ ,I9^O[-T1V'2OSsv-uI, jDppü-ny-pu qg 
(L) e24  (9)uonounj (S) eL efor (z) sse» 
Tu »uogd  [euono»ung (s) 1021994 (eL) PIEL (£) priom oy (4) agaang (D 
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ÁiuopI »uoudeuy ier ə[dduo AONSS DIDS at N ,uosied, unung ` ap 
Ayuep] Suroudeuy ier 1oop ,I9AO[-T1v'oOSsV-wur, jDppü-ny-bu X pc N loop [njneoq, ADM Qu;npupu — 90 
Ayuep] ou0ydeuy Ájnuq e[duo ada, rot eui, upnp(pasid Distiunepu ?6 N [dung AjoH oui, upnp(ipasid Disiupenpu ` ap 
Ayquepy — ououdeje? Aug e[du .əldurərL rot oui, upnp(ipasid ü:sürupsbuu ` oot A ,HdVNV-NIOINO, Dis[-13D1] ` 90 
Ayuep] ououdeuy Aug ə[dduo QINT-1a0'S€, [p-Jpi#pips ` eot NOud WON SE DIDS ap 
Auapt onuoudeuy ier o[dood ,uosied, ununq — eor NOUd WON AE. piou pO 
mup; onuoudeuy Ann poo POD, utupuiq gouiqsbS ` de N POD, uruvyIq gpuiqsps — qo 
Ayuep] ououdeuy ier ə[ddiuo INON'SE, DIDS ?6 NONd QINT-1a0'S€, [v-Jo1pivs ` eg 
modo ououdeuy ier o[dood ,uosied, unung pz N ,uosied, unung ^ ep 
onoudoxq ier poo N poo, ulunyig gpuiqsvg ` q6 
Ayuep] onuoudeuy Ájnuq e[duo .əlduər rot eui, upnp(ipasid üisürupspui qc N [dung oH oui, uvnphvasıd üisüupspvu ?6 
Ayup] Suoudeuy at aidat ada, rot oui, upnp(ipasid üisürupspui OC HdVNV ,HdVNV, Die ?6 
tuÁuo19Ww ououdeuy eump/o0]  e[duroj uoneoo[ 180 1s1a-o[dood, [vifp-]Jununq eg 3OVTa Əmo[-TIV'2OSSV, ipppü[-ny] ?6 
Áiuop »uoudeuy num | mof pue pd oo At. vifprpu RG NONd Joo At. pifpipu ?6 
Ayup] Suroudeuy ier ə[ddiuo 180 1s1a-o[dood, [v1s-Jununqg eg NOMNd WON SE DIDS ?6 
yredog Suroudeuy ier quim Sat, spjupq eg N auol, Gnjyng eg 
ped ououdeuy ie quim 180 151a-o[dood, [pifp-]Jununq eg N Sat, sp1upnq eg 
tuÁuo19Ww ououdeuy eump/o0]  e[duroj uoneoo[ ,I90[-T1V'DOSSV-WuI, tDpp(-ny-pu qc Liv  1go`rsiq-əldoəd, vifp-[ununq] eg 
Áy4uəpI ououdeuy Aug ə[dduo  1go:s€, DISPIDS qL N ,180'1s1a-o[dood, [pifp-]ununq eg 
()*r?W (9) uompuny (S) €L atoi (z) ssepo 
Tu onogd [euonoung (f) mgauatag (e Sre (e) proa oy Ey) uzy (1) 
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This chapter describes the expression of referents in Singaporean Malay using a parallel cor- 
pus of elicited narratives. We demonstrate that the speaker's epistemic stance affects how 
discourse is constructed. The speaker's epistemic stance is apparent in referent categorisa- 
tion: referents can be categorised either as "familiar", when taking a strong epistemic stance, 
or as "unfamiliar", when taking a neutral stance. We show that referent categorisation is 
more fundamental than the information structure notions of new, old, or given. Familiar 
human or animate referents are expressed with proper names and are pivotal for organ- 
ising the narrative plot: by constructing other discourse-persistent referents in relation to 
the familiar referent, their description and tracking simplifies. Human and animate refer- 
ents categorised as unfamiliar are expressed with nominals. Their descriptions and tracking 
are more elaborate, involving demonstratives and discourse particles, whose function lies 
in the coordination of joint attention. Inanimate referents are rarely subject of strong epis- 
temic stance and are therefore expressed with nominals. Their discourse-persistence is the 
best predictor of how elaborate their description and tracking are. 


1 Introduction 


One of the fundamental functions of human language is balancing the information dis- 
parity between the speaker and the hearer. It has been argued that the speaker and hearer 
both operate under the assumption that the world presents itself in the same way to their 
interlocutor. Under such an assumption, the speaker can "trade places" with the hearer, 
and can predict and mitigate obvious disparities (Rommetveit 1976; Zlatev 2008; Duranti 
2009; 2010). 

Frantisek Kratochvíl, Nur Izdihar Binte Ismail & Diyana Hamzah. 2018. Stance, cate- 
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The above information-disparity problem is examined through the study of informa- 
tion structure, i.e. the structural arrangement of various types of information, such as 
new, old, given, topic, focus, etc. (cf. Prince 1981; Gundel et al. 1993; Lambrecht 1994; Gun- 
del & Fretheim 2004). 

In a broader perspective, however, the complexity of expression of the new, old and 
given reflects the speaker's stance towards the utterance, reality, and the hearer. This 
stance, or alignment is manifested in the amount of information disclosed in order to 
mitigate disparity (cf. Du Bois 2007).! 

Du Bois' framework conceptualises stance as the process of evaluation and positioning 
towards the object of stance and the mutual alignment between subjects emerging from 
the interaction (Du Bois 2007: 171). Stance is achieved through overt communicative 
means towards any salient dimension of the sociocultural field (Du Bois 2007: 163). This 
process is visualised in Du Bois' original stance triangle, reproduced here in Figure 1. 


Subject 1 
ey. 
hae 
v A s A 
Os 
A On, 
2 š 
S9 Object 
E 
v 
sem 
e? quo 
A 
Subject 2 


Figure 1: Du Bois' stance triangle (Du Bois 2007: 161) 


We demonstrate that the compositional vectors of stance, namely evaluation, position- 
ing, and alignment can be applied to the study of information structure and referent 
expression. We expand the understanding of Du Bois' evaluation to include the choice 
in identifying a referent and referent categorisation, a term borrowed from Stivers et al. 
(2007). The categorisation positions the speakers towards the object differently in terms 
of their epistemic stance. The choice has consequences for the construction of subse- 
quent discourse, as will be documented in 83. 


2 Methodology, participants, and language situation 


The data for this paper consists of a set of elicited narratives in Singapore Malay. These 
narratives were collected using four stimuli sets: (i) Getting the Story Straight (San Roque 
et al. 2012), (ii) Pear Story (Chafe 1980), (iii) Frog Story (Mayer 1969), and (iv) Jackal 


!In Malay/Indonesian, an important work on this aspect of language is Englebretson (2007), which primarily 
deals with the choice of pronouns and its consequences. 
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and Crow (Carroll et al. 2011). The stories allow us to make a systematic comparison of 
how our subjects categorise a variety of referents (human, animate, inanimate, singu- 
lar, plural, etc.). By comparing how referents are introduced and tracked, we reveal the 
consequences of the categorisation for discourse construction. We rely on the annota- 
tion guidelines of the RefLex Scheme to distinguish various types of referents and their 
expressions (Riester & Baumann 2017). 

In this section, we describe the Malay spoken in Singapore (2.1), our participants (2.2), 
and the stimuli used here (2.3-2.6). The instances where we consulted our Singapore 
Malay Corpus are distinguished with corpus text identifiers.? 


2.4 Malay in Singapore 


The Malay language connects diverse varieties that form the Malayic subgroup of 
Austronesian. In all probability from Southern Sumatra, Malay varieties are now spo- 
ken throughout Indonesia, Brunei, Malaysia, Singapore and southern Thailand (Adelaar 
2004). In Singapore, Malay has always had a special status, given its former role as the ad- 
ministrative language and lingua franca and for its political value in the region (Alsagoff 
2008). 

Apart from the symbolic status of the national language of Singapore, Malay is one 
of the four official languages of Singapore, alongside English, Mandarin Chinese and 
Tamil. Malay is the assigned mother tongue of the ethnic ‘Malays’ in Singapore, a label 
comprising people of Malay, Javanese, Boyanese, and Sundanese descent as well as other 
smaller groups from the peninsula and archipelago, which make up 13.3% of the resident 
population (Kuo & Jernudd 1993; Singapore Department of Statistics 2015).? 

Standard Singapore Malay is the formal written and spoken variety taught in schools 
and used in formal contexts (government and media). It is similar to the standard va- 
riety used in Malaysia, with the addition of certain lexical items relevant to the local 
context. Colloquial Singapore Malay is the informal spoken variety. In the past, a num- 
ber of contact varieties emerged, with distinct syntactic, grammatical and phonological 
features (Sasi Rekha d/o Muthiah 2007). The best studied among them include: (i) Sin- 
gapore Baba Malay, a Malay creole influenced by Hokkien, which is typically spoken 
by the Peranakan population in Singapore (Lee 2014), (ii) Singapore Bazaar Malay, a 
Malay-lexified pidgin influenced by Hokkien which was the traditional lingua franca for 
interethnic communication (prior to the rise of English) and is typically spoken by Sin- 
gaporean Chinese, and (iii) Singapore Indian Malay, a Malay-lexified pidgin influenced 
by Bazaar Malay and Indian languages which is typically spoken by Singaporean Indi- 
ans (Adelaar & Prentice 1996; Daw 2005; Sasi Rekha d/o Muthiah 2007). Rising levels of 
bilingualism with English introduce contact features such as code-switching, borrowing 
of lexicon and structural convergence with Singlish. 


2Our Singapore Malay Corpus consists of about 100 conversations and narratives (spontaneous, planned 
and elicited), counting about 62,000 words. 

3The 2015 census reveals that English-Malay bilinguals make up 86.2% and 14.0% of the Malay and Indian res- 
ident population, respectively, and that Malay remains the dominant home language of the Malay resident 
population aged 5 years and over (78.4%) (Singapore Department of Statistics 2015). 
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2.2 Participants 


Our participants are all Singapore Malays from diverse linguistic backgrounds. JUR, ISM, 
and ISH grew up in monolingual Malay families, only beginning their English stud- 
ies when they entered primary school at the age of seven. AM, YAN, and SI grew up 
in Malay-dominant bilingual families. While their exposure to English was earlier, all 
three attended Malay-speaking kindergartens, and YAN and SI went on to private reli- 
gious schools, where the medium of instruction was Malay and English. MIZ grew up in 
an English-dominant bilingual family, while LO, HZ and NZ came from families where 
bilingualism was more balanced. Their formal education in English and Malay also be- 
gan in kindergarten. After kindergarten, AM, MIZ, LQ, HZ, and NZ went through the 
mainstream Singapore education system, where the medium of instruction was English. 


2.3 Getting the Story Straight (San Roque et al. 2012) 


The first stimuli collection is a graphic mini-novel depicting in 16 pictures the transfor- 
mation of a man, through a descent into jail caused a change of heart, from someone 
who drinks and beats his wife into a loving father and husband, as shown in Figure 2. 


As 


Figure 2: Getting the Story Straight storyline (San Roque et al. 2012) 


In the original set-up (see text 1 in Table 1), the pictures were presented in a stipulated 
sequence to two participants who negotiated and constructed the narrative. When fin- 
ished, they presented it to a newly arrived third participant. The entire experiment lasted 
about 20 minutes. The word counts offer a measure of the verbal effort with the second 
set-up, when the correct picture sequence is presented to a speaker who narrates it. No 
negotiation took place, since the second participant was instructed to take on the role of 
the listener. The task lasted only about five minutes on average, and required much less 
verbal effort (see texts 2—11). 


2.4 Pear Story (Chafe 1980) 


The second stimuli set is the Pear Story, a six-minute film. Set in the countryside, it depicts 
a loose sequence of events happening around an orchard, where a farmer is picking pears. 
A man walks by with a goat, and a boy on the bicycle comes to collect the fruit. When 
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he later falls and the load of pears spills on the road, three other boys come to his help, 
who each receive a pear in return. We recorded two versions. 


2.5 Frog Story (Mayer 1969) 


Frog Story is a picture book for children (see Figure 3) widely used for language compar- 
ison. It is the story of a boy whose pet frog escaped from its jar, so he sets out with his 
dog to find it. We recorded two versions of this story, listed in Table 3. 


4 q^ D; E d 
W ma NA + | beet at 


Figure 3: Frog Story storyline (Mayer 1969) 


2.6 Jackal and Crow (Carroll et al. 2011) 


Jackal and Crow consists of nine pictures presenting a version of the famous Aesop fable 
of The Fox and the Crow. The fox is drawn to be identifiable as a jackal, wolf, or dog, and 
the crow holds a fish, instead of cheese. 


Figure 4: Jackal and Crow storyline (Carroll et al. 2011) 


We again used two set-ups. The 2013 version is a narration of the picture sequence by 
a single speaker, while the 2014 version follows the original guidelines of Carroll et al. 
(2011) and is a negotiation of two speakers, who construct the narrative for a third par- 
ticipant. 
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Table 1: Collected versions of Getting the Story Straight (San Roque et al. 2012) 


text name words 
1. 2014.MLZ.GettingTheStoryStraight 2235 
2. 2017.NI.GettingIheStoryStraight.]UR 247 
3. 2017.NI.GettingTheStoryStraight.ISM ^ 249 
4. 2017.NI.GettingTheStoryStraight.MIZ 572 
5. 2017.NIGettingIheStoryStraight.AM 355 
6. 2017.NI.GettingTheStoryStraight YAN 437 
2 2017.NI.GettingTheStoryStraight.SI 512 
8. 2017.NI.GettingTheStoryStraight.ISH 239 
9. 2017.NIGettingIheStoryStraight.LO > 214 
10. 2017.NIGettingIheStoryStraight.HZ 380 
11. 2017.NIGettingIheStoryStraight.NZ 227 
Table 2: Collected versions of Pear Story (Chafe 1980) 
text name words 
L 2013.CA.PearStory 169 
2013.LN.PearStory 442 
Table 3: Collected versions of Frog Story (Mayer 1969) 
text name words 
3. 2013.0G.FrogStory 963 
4, 2013.SS.FrogStory 387 


Table 4: Collected versions of Jackal and Crow (Carroll et al. 2011) 


text name words 
1. 2013.0G JackalAndCrow 212 
2. 2014.MLZ.JackalAndCrow 660 
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3 Stance and referent categorisation 


In 81, we linked stance to the notion of referent categorisation. Referent categorisation 
refers to the choice a speaker makes by identifying the referents for the hearer. A funda- 
mental dichotomy exists between proper names and descriptions (nominal expressions). 

Categorisation with proper names positions the speaker as familiar with the object 
of stance. According to Sacks & Schegloff (2007), proper names satisfy two discourse- 
organisational preferences: (i) recognitional preference and (ii) minimised reference. For 
the first, it is easier to work out the reference to something familiar, even if familiar- 
ity is only constructed. The second is a preference for a stable, and perhaps a single, 
reference form, so that the expression-referent pair is stable. Proper names meet both 
requirements, but nominal expressions require more recognitional effort on the part of 
the hearer. In addition to speaker's stance, the choice of a proper name reveals aspects of 
speaker's identity, such as their relation to the topic and their self-positioning within the 
community (cf. Baresová 2016: 13). In a constructed narrative, the identity is symbolic. 

Categorisation with descriptions reveals the speaker's neutral epistemic stance.’ The 
hearer’s effort is greater in both recognition and maintaining reference, as will be appar- 
ent in 84. 

The effect of referent categorisation on discourse structure is most obvious among 
the eleven versions of Getting the Story Straight (see $3.1), and is also detected in the 
Frog Story (see 83.2). In contrast, the referent categorisation in the Pear Story is uniform. 
For categorisation of inanimate referents, their discourse role is the most important fac- 
tor. Discourse-persistent referents require more elaborate descriptions than "props" (see 


83.3). 


3.1 Human referent categorisation in Getting the Story Straight 


The main participants in Getting the Story Straight are: a farmer, his wife, child, and 
friends. The farmer is present in all frames, while the other characters play a less central 
role, sometimes restricted to a single frame. 

More than half of our subjects categorise the farmer with a proper name (usually a 
common Malay name such as Adam, Halim, Samad, or Zamri), making it a referential 
pivot for other human referents (farmer's family and friends). This strategy is in line with 
the preferences for person reference formulated in Sacks & Schegloff (2007: 24). Proper 
names are prototypical and ideal recognitional devices (Sacks & Schegloff 2007: 25) and 
their use is therefore referentially effective. The RefLex scheme classifies the first use of 
proper names as r-unused-unknown (Riester & Baumann 2017: 10). Example (1) illustrates 
that to track the given referent (RefLex r-given) proper names can be repeated. 


(1) Singapore Malay (2017.S1.12-14) 
[Abu] kedengaran me-racau-racau ber-tanda dia sudah mabuk. [Abu] 
PN audible Av-talk.incoherently Av-sign 3sG already drunk PN 


^The term descriptions is synonymous with nominal expressions. 
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mula ber-cerita yang bukan-bukan. Ini lazim ber-laku apabila [Abu] 
start Av-tell REL nonsense PROX common Av-happen when PN 
mabuk kerana minum minuman keras itu. 

drunk because consume alcohol DIST 


"Abu was heard to rave, which was a sign that he was drunk. Abu started to tell 
untrue stories. This habitually happened when Abu was drunk from drinking 


alcohol: 


The neutral epistemic stance leads to the categorisation of the farmer with a nominal 
expression as a petani ‘farmer’ (RefLex r-new).? Because the referent will persist in dis- 
course, it is typically introduced with a classifier phrase (cf. Hopper 1986: 317). We will 
return to this point in the discussion of example (6) and again in $3.3. 

The farmer's family and friends are always introduced through expressions of their 
relationship to the farmer, such as isteri=nya ‘his wife’ in (2). The RefLex scheme charac- 
terises such expressions as r-bridging-contained (Riester & Baumann 2017: 9). The bridg- 
ing containment is realised by possessive constructions available in Malay. It is interest- 
ing that when the farmer is given a name, his wife is usually given one too (e.g. Alia, 
Hawa, Huda, Laila), as in (2)° 


(2) Singapore Malay (2017.MIZ.01) 
kedua-dua Zamri dan [isterinya Alina] ber-kerja seperti pekebun. 
both PN and wife-3POSS PN Av-work as farmer 


‘Both Zamri and his wife Alina work as farmers. 


The categorisation of the child seems independent of the speaker’s stance towards the 
farmer and his wife. In our data, the child is rarely categorised with a proper name. 
In several versions, although depicted in Frame 2 as held by her mother, the child is 
introduced only in the domestic violence scene, as in (3). 


(3) Singapore Malay (2017. YAN.12) 
sambil men-dukung [anaknya ], Huda mem-beritahu Halim bahawa dia tidak 
while Av-hold child-3poss PN Av-tell PN COMP 3SG not 
mem-punyai apa-apa hubungan sama sekali dengan Khalid ... dengan Leyman. 
Av-have any relationship at.all with PN with PN 
‘While carrying her child, Huda told Halim that she did not have any 
relationship with Khalid [sic], ...with Leyman’? 


5A wealth of literature is dedicated to various aspects of the Malay noun phrase. The relevant devices are (i) 
classifiers (Hopper 1986; Chung 2000; 2008; Cleary-Kemp 2007; Chung 2010; Salehuddin & Winskel 2012), 
(ii) demonstratives (Himmelmann 1996; Williams 2009), (iii) relative clauses (Cole & Hermon 2005), (iv) the 
linker yang (van Minde 2008), and (v) the definite -nya (Rubin 2010). 

The bridging anaphor between the possessive -nya and its target Zamri is highlighted using the RefLex 
scheme convention, i.e. the target of the anaphora is underlined and the referential expression is in square 
brackets. 


48 


2 Stance, categorisation, and information structure in Malay 


Plurality is an important feature of human referents: the farmer's friends are always 
introduced in a reduplicated form as kawan-kawan 'friends'. The possessive =nya may 
associate them with the topical farmer. In some versions, the gossiping friend is named 
(e.g. Rashid, Wahid). Both strategies are combined in AM's version, where the friends 
are first introduced as a group in an earlier sentence, and then the gossiper is named as 
Rashid, as shown in (4). 


(4) Singapore Malay (2017.AM.08) 
[Rashid] menceritakan bahawa dia pernah ter-nampak isteri Pak Samad telah 
PN AV.tell COMP 3SGonce INVOL-see wife Mr PN already 
meng-gatal dengan Encik Romi semasa dia sedang mem-beli barang rumah di 
Av-chat.up with sir PN when 3sGPROG Av-buy item home in 
pasar. 
market 


“Rashid told everyone that he had seen Pak Samad's wife flirting with Mr. Romi 
while she was buying household items at the market’ 


The old man, who sees the fight between the farmer and his wife (Figure 2, frame 5), is 
usually categorised as a relative (usually as the father of the spouse) using a possessed 
noun, as in (5). This is a type of r-bridging-contained, where the possessor is already 
known from the context. The introduction is abrupt, because the old man calls the police 
right away, so there is no time or need to provide any other details. 


(5) Singapore Malay (2017.AM.14) 
[Papa Laila ] ter-nampak perkara ini lalu me-lapor-nya, lalu 
father PN INVOL-see event PROX then Av-report-3 then 
me-lapor-kan-nya ke polis. 
Av-report-APPL-3 to police 


“Laila s father saw the incident and reported it, reported it to the police? 


The neutral stance leads to a nominal categorisation of the old man as a neighbour, us- 
ing an enumerated classifier phrase, as in (6).7 In the RefLex Scheme, such a referent 
is classified as r-new (Riester & Baumann 2017: 11). It should be noted that the neutral 
stance to the old man does not exclude a strong stance to the farmer and his wife, whom 
SI categorises with proper names. 


(6) Singapore Malay (2017.81.23) 
kebetulan kejadian tersebut di-lihat oleh [se-orang jiran]. 
coincidentally event mentioned pv-see by one-CL.HUMAN neighbor 


‘Coincidentally, the incident was seen by a neighbour. 


THopper (1986) described the role and use of classifiers and provided parameters conducive to the use of 
classifiers based on written nineteenth-century Malay (pp. 313-314). According to Hopper, the primary 
function of classifiers is to grant discourse-new nouns prominence and the ability to become topics, whose 
referents are "individuated" and “persistent in discourse" (p. 319). 
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Categorisation of the policemen and court officials is fairly uniform, using various types 
of nominals. Bare nouns such as polis ‘police’, or a group compound pihak polis ‘police 
force’ are the most common? Within the RefLex scheme, the referent is classified as 
r-unused-known, because we assume that it is generally known and that appeal can be 
made to the local security force to stop violence. The only case where an indefinite de- 
scription (quantified classifier phrase) is used is shown in (7). This may be a consequence 
of enumeration, which in Malay requires a classifier phrase. 


(7) Singapore Malay (2017.SI.24) 
tidak lama selepas itu, [dua orang polis] datang dan mem-berkas Adam. 
not long thereafter two CL.HUMAN police come and Av-arrest PN 


‘Not long afterwards, two policemen came and arrested Adam: 


Proper names open up a referent-internal perspective: for example, the abuse by the 
farmer can be presented from the perspective of his wife or the court, and their stance can 
be constructed. This is shown in (8), where the farmer, introduced as Adam, is referred to 
as suami-nya ‘her husband’, embedding him in a kinship relation with expected norms 
of behavior. HZ's version uses the same strategy to mark the wife's perspective in the 
same point of the narrative (see Table 6). The speaker can establish and/or maintain 
differential perspective to the same referent in this way (cf. Enfield 2007: 107). 


(8) Singapore Malay (2017.SI.25) 
Semasa Hawa di-panggil untuk mem-buat kenyataan di balai polis, Hawa 
when PN pv-call to Av-make statement in station police PN 
kelihatan teruk di-cederakan oleh [suami-nya] sehingga mata, kepala 
appearance dreadful pv-injure.caus by husband-3poss so.that eye head 
dan leher-nya perlu di-balut. 
and neck-3Poss need pv-dress.wound 


"While Hawa was called in to make a statement at the police station, she seemed 
badly hurt by her husband, to the point where her eyes, head, and neck had to be 
bandaged’ 


3.2 Categorisation of human referents in Frog Story and Pear Story 


The two versions of Frog Story show a similar pattern as Getting the Story Straight. A 
stronger epistemic stance leads to categorisation of the boy with a proper name. The 
stronger stance allows the speaker to fabulate the boy’s character, emotions, and habits, 
as in (9), where the dog is described as the boy’s anjing kesayangan ‘beloved dog’, and 
the frog is expressed with a possessive phrase (r-bridging-contained). 


SThe root pihak is used in other group compounds such as pihak berkuasa ‘authority, agency’, pihak lawan 
‘opposition’, pihak musuh ‘enemy, enemies’, and pihak pengurusan ‘management’. 
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(9) Singapore Malay (2013.SS.FrogStory.01) 
Pada suatu malam sebelum tidur [Abu] dan [anjing kesayangan dia ] sedang 
on one night before sleepPpN  anddog beloved | 3s6 PROG 
me-renung [kataknya ]. 
AV-study frog-3Poss 


‘One night before sleeping Abu and his beloved dog were watching his frog: 
In an inverted manner, the speaker's neutral epistemic stance is reflected in a categorisa- 
tion with descriptions. In (10), both the boy and his dog are categorised with indefinite 


nominals (RefLex Scheme: r-new). The friendship between the dog and the boy is con- 
structed later, and is not included in the first description of the dog. 


(10) Singapore Malay (2013.0G.FrogStory.01) 


Pada suatu hari, ada  [se-orang anak kecil], budak lelaki, yang mem-punyai 
on one day exist one-CL.HUMAN child small boy REL AV-own 
[se-ekor anjing] sebagai teman-nya. 

one-CL.ANIMAL dog as friend-3poss 


“Once, there was a little boy, who had a dog as his friend? 


Both available versions of the Pear Story contain no proper names. New human referents 
(r-new) are categorised with enumerated classifier phrases and inanimates with bare 


nouns.? 


3.3 Categorisation of non-human referents 


Let us now turn to the Jackal and Crow texts, which describe a simple plot without 
human referents.'? Neither of the texts uses proper names; instead, participants are in- 
troduced into the discourse with enumerated classifier phrases (RefLex r-new), as in (11). 
The fragment also contains two presentational clauses, headed by the verb terdapat 'ex- 
ist, be attested in the world, be found’. Vague quantification with beberapa ‘several, few’, 
or with reduplicated plural forms such as ikan-ikan '(a variety of) fish' does not require 
a classifier. 


(11) Singapore Malay (2013.MLZ JackalCrow.140—141) 
Pada zaman dahulu, terdapat [se-ekor burung gagak]. Dah beliau 
in past exist one-CL.ANIMAL crow already 38G.HON 
ternampak beberapa bakul yang terdapat ikan-ikan. 
INVOL-see few basket REL exist ^ RED-fish 


‘Once upon a time, there was a crow. And it saw several baskets filled with fish’ 


?Our findings agree with those reported by Sukamto (2013), who studied written narratives of the Pear Story 
in Indonesian. 

1? As mentioned in $2.6, two set-ups were used to collect the two texts. For the analysis of the MLZ version, 
we are only concerned with the final summary of the story given to the third participant. 
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'The second text shows the same pattern. Animate non-human referents are categorised 
as descriptions, expressed with a classifier phrase, if the referent will become a topic. In 
(12), the referent burung gagak ‘crow’ is introduced as the subject of an inverted existen- 
tial clause headed by ada 'exist'. The inversion puts the focus on the predicate (Sneddon 
et al. 2012: 270). The subject is quantified (the numeral se- + the classifier ekor (animate)), 
as well as the object of the relative clause (beberapa ‘several’). A similar use of classifiers 
and quantification in introducing new referents is reported in Hopper (1986: 319) for the 
nineteenth-century written autobiography known as Hikayat Abdullah. 


(12) Singapore Malay (2013.0G JackalCrow.02) 


Pada satu hari ada [se-ekor burung gagak] yang men-jumpai beberapa 
on one day exist one-CL.ANIMAL crow REL AV-discover several 
bakul ikan. 

basket fish 


“Once, there was a crow that found several baskets of fish’ 


Although the fish is already mentioned as the content of the basket, as introduced in (12), 
this does not grant the fish the status of given information. It requires an upgrade from 
being a ‘prop’ to become a discourse-persistent referent (cf. Hopper 1986: 319). Analogous 
to other discourse-persistent referents, the single fish, which is to be picked up by the 
crow, is introduced with a classifier phrase, as in (13). 


(13) Singapore Malay (2013.0G JackalCrow.03) 
Jadi burung gagak itu meng-ambil [se-ekor ikan] untuk jadi bahan 
SO crow DST AV-pick one-CL.ANIMAL fish to become matter 
makan-nya untuk hari itu. 
food-3Poss for day DIST 


‘So the crow took a fish (OR one fish) as its meal for the day? 


It is interesting to note that the tree, on which the crow lands, is not mentioned at all in 
the second version. In the first version, its expression is unusual, requiring a placeholder, 
suggesting retrieval problems, as in (14). After the correct label is retrieved, it is realised 
as an N-DEM structure, with the reduced proximate ni, requiring resolution in the physical 
context. The RefLex scheme classifies such referents as r-environment. This is, however, 
a non-standard solution in the context of the narrative. 


(14) Singapore Malay (2013.MLZ JackalCrow.147) 
Dia LAND kat ker... apa nt... pokok ni. 
3sG csland at PART what PROX tree PROX 


‘It landed on what..., what's this..., on this tree? 
The above examples illustrate what Hopper (1986: 313) refers to as props. Event settings 
are described with bare nouns, which are occasionally enumerated, or reduplicated (be- 


berapa bakul ikan, ikan-ikan). Props are easily omitted where the context and world 
knowledge enable the hearer to construct them regardless. 
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To summarise, the speaker epistemic stance is most apparent in the categorisation 
of humans. A stronger epistemic stance leads to the use of proper names for the key 
characters. We will show in $4 that the tracking of such characters is simpler than of 
those humans categorised as nominals. For non-human participants, the speaker's epis- 
temic stance is less relevant than what Hopper (1986: 319) termed as persistence in the 
discourse. Future topics are introduced in a more elaborate way (typically with a classifier 
phrase) than props. Incidental props have only short persistence and require no tracking 
(cf. Hopper 1986: 320). Table 5 summarises the effects of stance and discourse role on the 
categorisation and expression of referents in our Malay corpus. We should keep in mind 
that elaborate descriptions can combine a nominal and a proper name, as in (2). For the 
sake of our hierarchy postulated here, we consider the proper name to be an indication 
of the speaker's stronger epistemic stance. 


Table 5: Effect of stance and discourse role on referent categorisation 


EPISTEMIC STANCE 


REFERENT CATEGORY STRONG NEUTRAL 
+human proper name classifier phrase 
+animate ? classifier phrase 
-animate, +discourse-persistent classifier phrase 
-animate, -discourse-persistent bare noun 


4 Categorisation and referent tracking 


Many referents persist in discourse for some time (Hopper 1986: 317) and dedicated con- 
structions indicate their status as given (RefLex r-given). In this section, we show that 
the initial stance and categorisation have global consequences for referent tracking. 

Human referents categorised with proper names, discussed in $3, are tracked with 
proper names and pronouns, usually dia and ia. Particles, demonstratives and other 
markers are used rarely. In contrast, human referents categorised with nominals are 
tracked in a more elaborate way, requiring a greater effort from the hearer. A range of 
devices are used, including repetition, and synecdoche; marking with demonstratives, 
particles, or relative clauses are all common ways of tracking. 

Figure 5 and Figure 6 visualise the categorisation and tracking of referents in two 
quite distinct versions of Getting the Story Straight. The expressions are time-aligned as 
they appear in the story.!! Continuous lines mean that the referent is not only discourse- 
persistent but also topical. Whenever the line is interrupted, another topical referent 
appears. Two lines coincide when a reference is made to more than one referent, either 


"The following abbreviations are used in Figure 5 and 6 and the tables in the remainder of this chapter: cr 
classifier, N noun, NUM numeral, PN proper name, POSS possessor, PRO pronoun, and RED reduplication. 
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with plural pronouns (mereka ‘they’), or with possessive constructions (indexing both 
the possessed and the possessor). 

Figure 5 illustrates the minimisation of reference: proper names are systematically 
followed by pronouns (cf. Heritage 2007: 260), but other devices are not used. As the 
narrative shifts, a proper name is used to activate the referent and the pronoun tracking 
it within the local macro-event, usually corresponding to a single picture. In two places, 


the speaker used synecdoche (N[N]), which corresponds to the blue line dropping to the 
bottom of the chart. 


8 


ball A. TOLU 


e Farmer 4 Wife + Child x Friends e Police 


PN harvest drinks ij anger VA. AE court WA sad freehome united quit drinking 
PRO VE WAAZ =< V 

N-POSS 

RED-N 

RED-N-POSS 


NUM-CL-N arrest release 
N . 


N[N]-POSS 
NIN] 


Figure 5: Storyline visualisation of the referential devices in SI version of Get- 
ting the Story Straight 


Figure 6 shows that the referents are introduced with a classifier phrase (NUM-CL-N) or 
with a possessor phrase (N-Poss), and are tracked almost without exception with pro- 


nouns. Particles pun, pula and the demonstrative tersebut are used to reactivate a given 
referent as a topic. 


2a ie TJ ig a ën 


e Farmer a Wife + Child < Friends e Police 


PN 


PRO home gossip o Ames court jail sad release free home F united o. 
Leen 
N-POSS fight / roh quit.drinking 
N-POSS-pula Ë Ë 
RED-N-POSS 
NUM-CL-N 1°" 
N 
N-pun S 
N-tersebut dips Kee Le 


Figure 6: Storyline visualisation of the referential devices in HZ version of 
Getting the Story Straight 
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Detailed discussion of the patterns visualised in Figure 5 and Figure 6 and those at- 
tested in other texts follow. Demonstratives and the particles pun, pula, and lagi will 
be treated in $5. We are only concerned with the nominal expression of referents; zero 
anaphora, word order alternations, and verbal morphology will be discussed elsewhere. 


4.1 Tracking of human referents 


The most common way to track human referents is with pronouns, followed by repeti- 
tion and synecdoche. The main characters of Getting the Story Straight (the farmer and 
his wife), regardless of their categorisation as familiar or unfamiliar, can be tracked by 
pronouns. Repetition of the proper name or the nominal expression is also common. In 
a complex sentence, proper names are restricted to the first mention and tracked with 
personal pronouns in subsequent positions, such as dia and -nya in (15). 


(15) Singapore Malay (2017.AM.09) 
er. [Zamri] sangat marah dengan pengetahuan ini, dan [dia], dan [dia] 


HESIT PN very angry by knowledge PRoxandj3sG and 3sc 
telus... terus balik rumah untuk marah isteri [-nya]. 
immediately return home to scold wife-3Poss 


"Zamri was very angry upon receiving this information, and he immediately 
went back home to scold his wife’ 


Multiple named referents are tracked with the plural mereka, as in (16). 


(16) Singapore Malay (2017. YAN.01) 
Seperti hari-hari biasa, Halim bersama isteri-nya Huda akan ke kebun 
as everyday accustomed PN together wife-3Poss PN will to garden 
mereka untuk memetik buah-buah labu yang telah pun masak. 
3PL to Av.pick mED-fruit pumpkin REL already ADD ripe 


'As on a normal day, Halim and his wife Huda would go to their garden to pick 
pumpkins that had ripened” 


Referents categorised with proper names are tracked with pronouns, even where an- 
other topic is present. This is the case in (17), where the speaker comments on the loss 
of appetite experienced by the farmer in jail. The farmer, called Halim in this version 
(see (16)), is tracked with the possessive -nya to background his experiencer role and to 


highlight his experience. 


(17) Singapore Malay (2017 YAN.22) 
Selera [-nya] juga ter-ganggu dan [dia] tidak dapat 
appetite-3POss also ` Arr-upset and 3sG not manage Av-finish 
meng-habiskan makanan yang ` di-berikan. 
food REL PV-give 


“His appetite was affected and he couldn't finish the food he was given: 
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Topical kinship terms, such as isteri-nya “his wife' in (18), are tracked with pronouns. 
Interestingly, the proper name Jack in the complement clause cannot become the an- 
tecedent of dia. This suggests that Malay anaphoric pronouns target the local topic, or 
that embedded proper names are not felicitous as antecedents for pronouns. 


(18) Singapore Malay (2017.LQ.07) 
Isteri-nya  ber-kata yang Jack salah faham ` dan [dia] setakat beli 
wife-3POss AV-say COMP PN misunderstand and 3sc so.far buy 
barang-barang pasaran sahaja. 
things market only 
‘His wife said that Jack misunderstood and that she only bought goods from the 
market: 


As mentioned in the discussion of Figure 5, proper names are tracked with pronouns 
where one description is a paraphrase of an earlier one, or follows from it in a logical 
Way, as in (19). 


(19) Singapore Malay (2017. YAN.30-31) 
[Halim] mem-beritahu berapa seksa-nya hidup di dalam penjara. 
PN Av-tell how.much torturous-INTENs life inside prison 
[Dia] me-luahkan rasa kesal [-nya] di atas perbuatan ganas 
35G Av-express feeling repent-3Poss at action violence-3Poss past.time 
[-nya] tempoh hari akibat mabuk me-minum minuman keras. 
in.result drunk AV-drink alcohol 
‘Halim told them what a torment life in prison was. He expressed his feelings of 
regret over his brutal actions a few days ago, the result of being drunk from 
drinking liquor. 


In (20), a single macro-event in three sentences characterises the farmer's ordeal in jail. 
The proper name Adam is used only in the first sentence, and tracked with dia '3sc* 
subsequently. 


(20) Singapore Malay (2017.SI.29-31) 
Sewaktu di dalam penjara, [Adam] tidak henti-henti menangis. [Dia] tidak 


while inside jail PN not RED-stop AV.cry 3SG not 
lalu untuk makan. [Dia] hanya duduk menangis di dalam penjara yang gelap 
happento eat 3sG only sit av.ry inside jail REL dark 


lagi berbau itu. 
ADD smell DIST 


"While in jail, Adam did not stop crying. He could not eat. He only sat crying in 
the jail that was dark and smelly: 


We have shown that proper names are tracked with pronouns, and that their repetition 
creates a rhythm of sub-events. This strategy applies to the main character, the farmer. 
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A somewhat different strategy is used to track the farmer's wife and child. Apart from 
repetition, particularly common is synecdoche (RefLex r-given); the farmer's wife and 
child are referred to as keluarga ‘family’, as in (21). In another version, the farmer's status 
is characterised as berumah-tangga ‘married, having a family’, or the couple is referred 
to with suami-isteri. 


(21) Singapore Malay (2017.AM.26) 
Pak Samad cuba mengeratkan hubungan-nya dengan [keluarga-nya]. 
Mr PN try Av.strengthen relationship-3Poss with ` family-3Poss 


"Pak Samad tried to improve his relationship with his family? 


In one version of Frog Story, the boy is categorised with a proper name and tracked by 
repetition and pronouns (see Table 8). His dog and frog are categorised and tracked with 
possessives, highlighting their relationship to the boy. 


(22) Singapore Malay (2013.SS.FrogStory.11) 
Kemudian Abu dan [anjingnya | memanggil-manggil [kataknya ] lalu 
subsequently PN and dog-3POss RED-call frog-3Poss through 
tingkap [biliknya ]. 
window room-3Poss 
“Ihen Abu and his dog called repeatedly for his frog through the window of his 


room. 


Categorisation with descriptions reflects the speaker's neutral epistemic stance (see also 
83). Tracking of such referents is more elaborate and besides repetition, pronouns and 
zero anaphora also include demonstratives and particles (see also 85). 

Another tracking strategy involves relativisation. Unlike demonstratives or particles, 
however, relativisation can embed another perspective. In the final scenes of Pear Story, 
the farmer is puzzled by seeing three boys walking by with his pears. The farmer is 
not aware that the boy who took one of his baskets shared the fruit with these boys 
when they helped him to pick up the scattered fruit. LN resolved this by constructing 
the three boys as new, taking up the farmer's perspective, as in (23). Notice that the 
noun pear combines with tadi, conforming to the all-knowing perspective of the speaker- 
storyteller. 


(23) Singapore Malay (2013.LN.PearStory.31) 
Kemudian dia ter-nampak tiga, [tiga orang budak yang sudah 
subsequently 3sG INvor-notice three three CL.HUMAN child REL already 
me-makan pear tadi]. 
AV-eat pear previous 


"Ihen he saw three..., three boys who were eating the pears from earlier on? 


In CA's version in (24), the passing boys are presented as the subject of an inverted 
intransitive clause and modified by a relative clause referring to the pears received for 


their help. 
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(24) Singapore Malay (2013.CA.PearStory.13) 
<unclear> datang [tiga budak] yang mem-bantu budak yang <unclear> makan 
arrive three boy REL Av-help boy REL eat 
buah yang di-petik. 
fruit REL pv-pick 
‘[While he was thinking], coming there were three boys who helped the [fallen] 
boy, eating the harvested fruit: 


Relative clauses are utilised in Pear Story to distinguish between the children (boy, girl, 
and the three boys). In the case of the boy, reference is made to his fall, as in (25). The 
false start with code-switching may reveal the decision-making of the speaker as to how 
to most effectively categorise the boy, i.e. with reference to boys who helped, or with ref- 
erence to the fall. In general, the more elaborate descriptions distinguishing the children 
confirm our point about the simplifying effect of strong epistemic stance on referent 
categorisation and tracking. 


(25) Singapore Malay (2013.LN.PearStory.25) 
Budak yang CHILDRE... [budak yang ter-jatuh tadi itu pun] terus 
boy REL cs.children boy REL INVOL-fall RECENT DIST EVENT direct 
menunggang basikal-nya kembali. 
AV.ride bicycle-3poss back 


“The boy that children..., the boy who fell just now continued riding his bicycle? 


Table 6 summarises the expressions of the participants in Getting the Story Straight, in 
the order in which they appear in the narrative (the first mention is underlined). The 
horizontal line divides the texts into two groups according to the epistemic stance, taking 
the categorisation of the farmer as a criterion. Above the line are seven texts where the 
farmer (and usually also his wife) is categorised with a proper name. In the remaining 
four texts, the speaker's neutral epistemic stance is apparent in the categorisation of the 
farmer with a description. In contrast, categorisation of the police and friends is more 
uniform. 

The most significant patterns, discernible in the table, are the following: (i) the speaker's 
strong epistemic stance is reflected in the categorisation of prominent referents with 
proper names; (ii) repetition and pronouns are used to track them; (iii) neutral stance 
leads to categorisation with descriptions, typically an enumerated classifier phrase; (iv) 
various types of NPs (including demonstratives and particles) and pronouns are used 
for tracking; and (v) the farmer's wife, child and friends are referred to with possessive 
phrases. 


4.2 Tracking animate and inanimate referents 


Tracking of non-human referents (animate and inanimate) shows a split pattern, a more 
diverse one to track animates, especially when fabulated as capable of inner speech 
(thoughts, plans, or emotions). Inanimate referents, on the other hand, are rarely tracked 
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beyond their first introduction. Discourse-persistent inanimate referents are tracked 
with various types of NPs, but never with pronouns. Table 7 and Table 8 summarise 
the tracking devices in Frog Story and Jackal and Crow. 


Table 7: Categorisation and tracking of referents in Jackal and Crow 


version crow basket fish jackal tree 
MLZ NUM-CL-N QUANT-N-RC RED-N N ^? N-DEM 
pech, PRO(*pun)* 
N(* pun), N, 
the+n@ N+itu 
OG NUM-CL-N QUANT-N N NUM-CL-N n.a. 
pRO, PRO, 
N[N]° N N 
N[N]+itu N+itu N+itu(+pun) 


"bare noun is followed by the additive focus particle pula in (43), but we analyse it as scoping over the entire 
clause 

Pboth the honorific beliau and default dia are used 

ĉa range of Ist, 2nd and 3rd person pronouns are used 

4codeswitching is used: the gagak ‘the crow’ 

*N[N]: the crow is referred to as burung gagak ‘crow’, or as burung ‘bird’ 


Table 7 shows that referent categorisation and tracking in Jackal and Crow is quite uni- 
form. A single referent, the tree in which the crow perches, is completely omitted by OG. 
In contrast, the two versions of Frog Story display the same epistemic stance variation 
as the Getting the Story Straight texts, as shown in Table 8. 

We now turn to the tracking of non-human referents. For animate referents, repetition 
and pronouns are common. In (26), the frog (katak Abu Abus frog’) is tracked with 
the possessive -nya, partly because the frog is fabulated as an experiencer (capable of 
emotion), and thus the description of the boy and dog is consistent with the frog's per- 
spective. 


(26) Singapore Malay (2013.SS.FrogStory.54) 
Nampaknya [katak Abu] sangat merindui kawan-kawannya |. 
apparently frog PN very miss RED-friend-3P0ss 


"Apparently Abu's frog missed his friends: 


In (27), personal pronouns dia and beliau (honorific) refer to the crow, where the hon- 
orific is a clue of speaker's sarcasm. 


(27) Singapore Malay (2013.MLZ JackalAndCrow.141-2) 
Dah [beliau] ter-nampak beberapa bakul yang terdapat ikan-ikan. SO [dia] 
already 3SG.HON INVOL-see few basket REL exist ^ RED-fish cs.so 3SG 
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Table 8: Categorisation and tracking of referents in Frog Story 


version boy dog frog jar forest bees rodent hole? 
OG NUM-CL-N NUM-CL-N NUM-CL-N NUM-CL-N | NUM-CL-N | N-RC NUM-CL-N | NUM-CL-N 
PRO, PRO, PRO, PRO, 
N-POSS N-POSS, N-POSS N-POSS 
N[N]G+ itu) Ntitu(+pun)  N«itu n[N]+(itu) — w(*itu) N(+itu) N[N](+itu) pp? 
SS PN N-POSS N-POSS N-POSS N N-RC n.a. NUM-N-PP 
PN, PRO(+pun), PRO(+pun) 
(RED-)N-POSS (RED-)N-POSS N-POSS N-POSS RED-N 
N+itu N+itu N+itu 
owl rock branch antlers deer water log frogs 
OG NUM-CL-N NUM-N NUM-N-RC N-POSS NUM-CL-N — N-RC NUM-N ^ N-Poss 4 
PRO, PRO, PRO, N[N], 
N(* itu) N+itu N+itu N+itu N N+itu NUM-CL-N+itu 
SS NUM-CL-N NUM-N NUM-N-RC ©  N-POSS N nf N-RC N-POSS 
NUM-CL-N 
N+itu N+itu N+itu N+itu N+itu N N[N](+itu) 


“the cavity occupied by the owl 

bpp: prepositional phrase: the hole referred to as di dalam ‘inside’ 

“the constituent could be interpreted as a compound satu akar pokok ‘a tree root’ or a possessive construc- 
tion satu akar pokok ‘a root of the tree’ 

"realised as a possessor in bunyi-bunyi katak ‘frog sounds’ and treated as given thereafter 

“realised as satu ranting pokok yang... "a tree branch that ..’ 

frealised together with antlears as tanduk rusa ‘deer antlers’ 


pergiah, terbang, terbang, terbang, [dia] pergi dekat ikan tu, zoop! 

go PART fly fly fly 3sG go near fish DIST INTER 

“And he (the crow) saw several baskets filled with fish. So he went, flew, he came 
to the fish and went zoop? 


The same version contains a mini-dialogue, shown in (28), where the first and second per- 
son pronouns refer to the jackal and crow, respectively. The follow-up comment where 
the speaker praises his own story-telling performance is another clue of the speaker’s 
sarcasm. 


(28) Singapore Malay (2013.MLZ JackalAndCrow.164) 
[Saya] nak dengar suara [awak] yang merdu lah. Chey, macam betul aja. 
1SG want hear voice 2sG REL sweet PARTINTER, kind genuine PART 


‘I want to hear your sweet voice, hey that kind of sounds just right’ 


A summary of the categorisation and tracking of non-human given referents in Jackal 
and Crow and Frog Story is given in Table 7 and Table 8. The common pattern is the 
limited variation in the description of inanimates, and the lack of tracking thereof. When 
tracked, the default is to include the demonstrative itu, which will be discussed in §5.1. 
Finally, Table 9 shows that the categorisation and tracking of referents in Pear Story is 
fairly uniform. Neither speaker takes a strong epistemic stance, with the result that the 
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tracking of the children is quite elaborate to distinguish the boy with the basket, from 
the girl on the bike and the boys who help him pick up spilled fruit. 


Table 9: Categorisation and tracking of referents in Pear Story 


version farmer fruit baskets shepherd boy bicycle girl 3 boys 
CA NUM-CL-N , N NUM-N^ na. NUM-CL-N , N n.a. NUM-N P 
PRO, N^ fadi PRO, N* itu PRO, N(+NUM) 
si-N* itu 
LN NUM-CL-N , N N-PP^ N-POSS ,  NUM-CL-N,PRO, N NUM-CL-N | NUM-CI-N 4, PRO+pun 
PRO, N-* itu, PRO N- lagi^tadi* itu PRO+tadi, NUM-CL-N 
N+tadi RED-N-POSS 
“NUM-N: quantified noun, e.g. tiga bakul ‘three baskets’ 
PNUM-N: enumerated noun phrase tiga lagi budak kanak-kanak ‘three small boys’ 
^N-PP: noun with a locative prepositional phrase locating the baskets in relation to the tree 
d. 


realised as tiga orang dak laki ‘three boys’ 


The effect of stance and discourse role on the tracking of referents is summarised in 
Table 10, whose structure parallels that of Table 5 above. 


Table 10: Effect of stance and discourse role on referent tracking 


EPISTEMIC STANCE 


REFERENT CATEGORY STRONG NEUTRAL 
+human PN, PRO PRO, N(+itu/PART) 
+animate PRO(+PART), N-POSS/+itu PRO, N(+itu/PART) 
-animate, +discourse-persistent PRO, N-POSS/+itu N(+itu) 
-animate, -discourse-persistent n.a. 


5 Maintaining joint attention 


In the previous two sections we discussed the role of stance for referent categorisation 
and tracking. This section focuses on another aspect of interaction and balancing of infor- 
mation disparity. This interactive aspect is part of Du Bois' stance model, conceptualised 
as the alignment between the interlocutors (cf. Du Bois 2007: 171). As Du Bois puts it: I 
evaluate something, and thereby position myself, and thereby align with you (2007: 163). 
Du Bois' alignment falls within a larger notion of joint attention, which is a type of so- 
cial cognition (cf. Tomasello 1995). Diessel (2006) applies the notion of joint attention to 
demonstratives, whose primary roles he identifies in (i) locating referents relative to the 
deictic centre, and (ii) coordinating the interlocutors' joint attention (Diessel 2006: 469). 

While demonstratives are certainly the most prominent joint-attention coordinating 
devices in Malay (cf. Himmelmann 1996; Williams 2009), Malay possesses a number of 
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adnominal markers with a similar function, most importantly pun and pula. In this sec- 
tion, we analyse the use of Malay demonstratives and other adnominal markers in re- 
lation to coordination of interlocutors' joint attention and show how the use of these 
devices is related to epistemic stance and referent categorisation. The data suggests that 
neutral stance and nominal expression of referents correlate with the use of demonstra- 
tives. By taking a neutral stance, the speaker expects greater recognitional effort on the 
side of the hearer and compensates by providing more clues so that joint attention can 
be maintained. We will demonstrate that these clues are demonstratives and particles. 
While key characters of the story do not require the use of these clues frequently, they 
are used whenever a more peripheral participant becomes a topic. 


5.1 Malay demonstratives 


Malay demonstratives (both long and short forms) may introduce new information and 
track "persistent" referents throughout discourse (Himmelmann 1996: 241). The use of 
demonstratives has implications for how the perspective of the hearer is constructed in 
interaction, as either having or lacking access to the intended referent (Williams 2009), 
and indicates the speaker's stance. Our discussion of the data again follows the referen- 
tial hierarchy, starting with human referents. 

In the following fragment, the farmer is tracked with a demonstrative phrase. Such 
use is common in texts where the referent was categorised with a description. The 
speaker prevents a possible misalignment with the hearer by using the demonstrative 
and putting focus on the farmer, as affected by polis ‘police’, the local topical agent, which 
moves the plot. 


(29) Singapore Malay (2017.JUR.18) 
Polis tiba | dan menangkap [petani itu]. 
police arrive and av.seize farmer DIST 


"Ihe police arrived and caught the farmer: 


Another example is given in (30), which immediately follows (50). The distal itu puts the 
gardener in focus, and constructs the child's perspective as not recognising her father. 
The distal form does not have any spatial meaning here, because the man has just arrived 
in the scene. Instead, it creates an emotional distance, and marks the stance of the child. 
It locates the responsibility for the non-recognition within the child, and ultimately in 
the abusive behaviour of her father. 


(30) Singapore Malay (2017.ISM.21) 
Anak-nya tidak kenal kepada [pekebun itu ]. 
child-3Poss not know to gardener DIST 


‘His child did not recognise the gardener: 


12 An overview of all the expressions of key referents in Getting the Story Straight can be found in Table 6. 

Williams (2009) describes a similar use of demonstratives in Indonesian conversation. Djenar (2014) shows 
that nih and tuh have presentative, directive and expressive functions, and explains why tuh is used for 
recognitional and discourse deixis. 
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'The most common way to track discourse persistent non-human referents is with N+itu. 
The demonstrative has a similar function as the English definite article, marking the 
given referential status of the referent. The distal form does not imply any contrast or 
any spatial relation, and its function is purely referential in marking the given referent 
and perhaps aids the hearer in identifying the referent. We do maintain the gloss DIST 
in (31), but a gloss civ for given would be equally plausible. 


(31) Singapore Malay (2013.0G.FrogStory.03) 
Umm. Dan mereka ber-main bersama-sama tiap-tiap malam, di mana [anjing 
HESIT and 3PL ` Av-play RED-together RED-each night where dog 
itu] akan tidur di bawah [katil [budak lelaki itu]], sementara [katak itu] akan 
DIST will sleep below bed boy DIST while frog vist will 
tidur di dalam peti gelas-nya. 
sleep inside ` glass.jar-3Poss 
‘Mmm. They played together every night and the dog would sleep under the 
boy's bed while the frog slept in its jar’ 


The above characterisation of itu as a definite marker is further supported by the code- 
switching patterns. Speakers of Colloquial Singaporean Malay frequently code-switch 
in English across genres. Consider now (32), where the NP contains the English definite 
article the, where one would expect itu. The English then corresponds to the eventive 
pun, which will be discussed in $5.4. 


(32) Singapore Malay (2013.MLZ JackalAndCrow.167) 
THEN THE  gagak, THEN THE  gagak nyanyi 
cs.then CS.DEF crow  CS.THEN CS.DEF crow sing 


‘Then the raven sang: 


The following two examples from Frog Story are a pair, where (33) shows the categorisa- 
tion of a pair of adult frogs in the final episode of the story. A description consisting of 
a possessive construction presents the frogs indirectly as “emitters” of the sound. 


(33) Singapore Malay (2013.0G.FrogStory.30) 
Selepas itu, mereka um, jalan ke satuah, lagi satu uh, akaruh, pokok ya, 
thereafter 3PL ^ PART move to one HESIT other one PART root PART tree yes 
dan budak lelaki itu suruh anjing-nya diam, kerana dia men-dengar ah, 
and boy DIST ask dog-3Poss silent because 3sG Av-hear ^ HESIT 
[bunyi-bunyi katak ] di belakang mm, dahan pokok itu ya. 
RED-noise frog behind PART branch trunk ost yes 


"After that, they walked to another tree root, and the boy instructed his dog to be 
quiet because he heard frog noises behind the tree trunk: 


Subsequently, the frogs are tracked with w- itu (RefLex r-given). 
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(34) Singapore Malay (2013.OG.FrogStory.31) 
Jadi dengan senyap, mereka dekat berhampiran dengan um, [katak itu], dan, 


so with silence 3PL ` near adjacent with PART frog DIST and 
akhirnya mereka jumpa dua ekor katak di belakang um, pokok itu, ya. 
finally 3PL find two CL.ANIMAL frog behind PART tree DIST yes 


‘So with silence, they approached close to the frog and finally they met two frogs 
behind the tree. ' 


The proximal ini is used less frequently and does entail that the referent is spatially prox- 
imate. The viewpoint from which the proximity is constructed can shift and be located 
within the participants. In (35), the boy's perspective is taken to refer to the frogs, as 
well as to the relative temporal ini ‘now’, located within the story. 


(35) Singapore Malay (2013.0G.FrogStory.33) 
Jadi mm, selepas budak lelaki itu, ber-cerita-kan kepada kedua, uh, 
SO PART subsequently boy DIST AV-tell-APPL to couple PART 
[ibu dan bapa katak ini ], bahawa ia mahu mem-bawa balik, uh katak 
mother and father frog PROX COMP 3sG wish Av-carry return PART frog 
yang sebelum ini berada di rumah-nya. 
REL previously PROX be in house-3POSS 


'So after the boy explained to both the father and mother frog that he wanted to 
bring back that frog that before this was in his house’ 


Apart from the spatial ini and itu, there are three more deictic forms which do not have 
spatial uses, but are common in discourse: tadi ‘recently mentioned’, tersebut ‘aforemen- 
tioned’, and si ‘familiar’, which will be described below. Their use correlates with a neu- 
tral epistemic stance and categorisation with descriptions, except for si, which expresses 
familiarity and therefore marks a stronger epistemic stance. 

The demonstrative tadi “recently mentioned' is a dedicated anaphoric form derived 
from an adverbial meaning ‘earlier’ (Sneddon et al. 2012: 133). It is likely grammaticalised 
to the adnominal position through a yang modifier construction: N yang tadi » N tadi. 
In one version of the Pear Story, it tracks the farmer picking fruit. The example given in 
(36) is beautiful, because it verbalises the intention behind using tadi in the preceding 
phrase kita patah balik... ‘let us return back’. 


(36) Singapore Malay (2013.CA.PearStory.11) 
Jadi bila, kita patah balik kepada [perkebun tadi ], masa dia turun 


so when 1PL.INCL turn.back to farmer RECENT time 3sG descend 
daripada pokok dia nampak tadi, dia nampak agak aneh kerana sebab 
from tree 3sG see recently 3sc see slightly weird because reason 


masa dia naik ada tiga bakul. 
time 3sG climb be three basket 


‘So back to the farmer from earlier, the time he came down from the tree he 
found it weird as he last saw three baskets. 
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The anaphoric demonstrative tersebut ‘aforementioned, that’ is used with expressions 
referring to the farmer in Getting the Story Straight. Singapore Malay speakers base some 
of their stylistic preferences on their formal education; the use of particles and of the 
demonstrative tersebut strikes native speakers as formal and rote-like. In NZ version, 
where tersebut is used more than in all the other texts combined, the particle is used to 
track the farmer, his friends and the police (see Table 6). Apart from tracking, tersebut 
puts the focus on the given referent. We gloss it as GIVEN.FOC and translate it with the 
English that, which can also have a focusing role. Its use correlates with the neutral 
epistemic stance and categorisation of referents with descriptions. Its extensive use by 
NZ is illustrated in (37). We believe that the frequent use is a personal characteristic of 
NZ, rather than a general pattern. 


(37) Singapore Malay (2017.NZ.05-08) 
Dalam kemarahannya itu dia pun menumbuk isterinya. Se-orang 
in anger-3POSS DIST 3SG EVENT AV.punch wife-3POSS one-CL.HUMAN 
orang tua yang ter-lihat [kejadian tersebut ] pun, uh, | mmm, memanggil 
oldman REL INVOL-see event ^ GIVEN.FOC TOP2 HESIT HESIT AV.call 
polis dan [polis tersebut pun] uhhh, menangkap [lelaki tersebut ]. Di 
police and police GIVEN.FOC EVENT HESIT AV.catch man GIVEN.FOC in 
balai polis pula, uh, isteri-nya pun mem-beritahu keterangan tentang 
station police then HESIT wife-3POSS TOP2 AV-report testimony about 
[kejadian tersebut | kenapa ia terjadi. Suami-nya bun takut dengan, 
event  GIVEN.FOC why 3sG happen husband-3poss Topz afraid with 
aah, apa yang akan menjadi terhadapnya. 
HESIT what REL will happen about-3 


‘In his anger he then punched his wife. An old man who saw that incident then 
called the police and those police then caught that man. Then at the police 
station, his wife explained that incident and why it had happened. Her husband 
then got frightened over what would happen to him’ 


In our corpus, the demonstrative si is used sparsely. Traditional grammars attribute si a 
diminutive function (Sneddon et al. 2012: 146) and report that it is never used in address 
terms, but only in reference to somebody who is not the hearer (Sneddon et al. 2012: 
374). The Wiktionary entry for si contains an accurate characterisation; in addition to 
‘friendly connotation’, ‘diminutive’, and ‘friendly categorisation’, it also lists “generic 
categorisation’, exemplified in (38). 


(38) Indonesian (Wiktionary.si#Indonesian) 
[Si ayah] harus belajar mengenal [si anak]. 
father must learn Av.know child 


“The father has to learn to know the child’ 


“Note that the NP seorang tua yang...pun combines with pun, while newly introduced into discourse, but 
immediately cast as topic. The particle pun seems to work in tandem with tersebut, where one marks the 
new topic and the other links explicitly the relevant given referent. 
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We propose that si is a marker of familiarity, restricted to human referents. It is an ex- 
pression of a strong epistemic stance.’ Si draws interlocutors’ attention to a referent 
by presenting it as familiar, i.e. identifiable within one's knowledge, or recent discourse. 
Tracking proper names with a si phrase follows the triangular pattern of person refer- 
ence identified in (Haviland 2007: 229-230), where a new referent is anchored in relation 
to both the speaker and the hearer. The si phrase is an explicit anchoring effort in rela- 
tion to the familiar knowledge of the hearer. Within the stance framework proposed by 
Du Bois (2007), it is also an alignment device which explicitly interacts with the inter- 
locutors' perspective. 

It is not relevant that the familiarity is only constructed as such, because existing 
familiar referents are identified in exactly the same way, as we will show in (41). In 
(39), a discourse-recent referent marked with si is presented. Sukamto (2013) observes a 
similar pattern in written Indonesian accounts of Pear Story. Expressions re-activating 
the given participants tend to be highly specified, and combine with both si and sang in 
the Indonesian texts. 


(39) Singapore Malay (2013.CA.PearStory.05) 
Semasa dia memetik buah dia atas, ada se-orang budak me-naiki 
when 3sG Av.pick fruit 3sc above exist one-CL.HUMAN boy ` Av-travel.by 
basikal dan dia ter-nampak buah di dalam bakul itu lalu dia memikir 
bicycle and 3sc INvot-notice fruit inside basket pisr then 3sG Av.think.about 
harus-kah dia meng-ambil tetapi memandangkan [si perkebun itu] 
need-Q.PART 3sG Av-take but considering FAMILIAR farmer DIST 
begitu perihatin dengan memetik buah di atas pokok lalu dia meng-ambil satu 
so concerned with ^ Av.pick fruit on.top tree then 3sG Av-take one 
bakul tampa  izin. 
basket without approval 


"When he picked the fruits above, a boy riding a bicycle saw fruits in the basket. 
Then he thought whether he should take some, but considering that our farmer 
was so concerned with picking fruits, he actually took one whole basket without 
permission’ 


In our Singapore Malay corpus, si is used invariably to refer to relatives, partners or 
friends who do not take part in the interaction. The fragment in (40) is taken from an 
interview with an elderly speaker of Singapore Malay, who describes here how she got 
engaged. Her future father-in-law used to ask her, whether she had yet found a mata-air 
‘beloved’ and whether she liked his son (absent during the exchange). 


The notion of familiarity is defined by Gundel et al. (1993: 278) as a special cognitive status where the 
hearer already has a representation in memory, either in long-term memory, absence of recent mention, 
or in short-term memory, if he has. 
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(40) Singapore Malay (2016.BandarGirls.202) 
Kau suka tak, dengan si... Arsyad? 
2sG like not with FAMILIAR PN 


‘Do you like [our] Arsyad, don’t you?’ 


In (41), a mother asks whether her son, who is preparing for a math exam, is finished 
with his tutor (absent during the exchange). This is the first mention of the tutor in the 
conversation, and later in that same conversation, the tutor is tracked with dia. 


(41) Singapore Malay (2013.SNS.Exam.17) 
Abeh tak belajar eh? Dah habis [si dia tu] ajar dah habis? 
then not study Q.PART already finish FAMILIAR 3SG DIST teach already finish 


"Why don't you prepare anymore? It is done what he [the tutor] taught you?’ 


In our narrative corpus, human referents categorised with proper names may be accom- 
panied by an appositive si phrase. In (42), the vegetable seller is constructed as familiar to 
the farmer, amplifying the effect of the accusation and explaining the rage that follows.'® 


(42) Singapore Malay (2017.51.15) 
sewaktu sedang mabuk Abu mem-beritahu Adam bahawa dia ter-nampak Hawa, 
while proc drunk PN av-tell PN COMP  3SGINVOL-see PN 
isteri Adam, sedang ber-mesra-mesra bersama [Sani, si penjual sayur] di 
wife PN PROG AV-RED-cozy together PN ART seller vegetable in 
pasar. 
market 


"While he was drunk, Abu told Adam that he saw Hawa, Adam's wife, behaving 
in a friendly way with Sani, [you know] the vegetable seller at the market: 


In summary, Malay si interacts with a specific layer of the hearer's memory: either with 
the recent memory, or with personal knowledge and stereotypes. Marking unknown and 
unfamiliar referents with si is a request for cooperation to either fill out the speaker's 
intention, and accept the referent in a common ground (in statements), or to supply the 
relevant knowledge in the next turn (in questions). It is the ultimate device forcing joint 
attention." 

We will now proceed to discuss the Malay particles pula, lagi, and pun, whose function 
in manipulation of joint attention is even more complex than that of the demonstratives 


discussed here. 


léThe man introduced in the drunk gossip (see Figure 2, frame 4), is sometimes given a name, such as Encik 
Romi in (4), or is referred to with a proper name followed by a nominal marked with si, such as Leyman, si 
penjual surat khabar 'Leyman, the news agent’. 

"There are some interesting parallels with other markers of familiarity, such as the New Zealand y'know 
(Stubbe & Holmes 1995: 69), the Abui hearer-oriented forms (Kratochvíl & Delpada 2015), or the more 
grammaticalised systems of engagement (Evans et al. 2017a,b). 
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5.2 Particle pula 


The Malay particle pula (colloquial pulak) is traditionally characterised as an additive fo- 
cus particle (Sneddon et al. 2012: 236). Nomoto (2017: pula(k)) distinguishes between two 
functions of the Malay pula: (i) when placed after the predicate, the particle indexes the 
speaker's epistemic stance — the situation is marked as not conforming to the speaker's 
expectation, as surprising, or as evoking doubts; (ii) when combined with nominals, pula 
encodes contrast, but also interacts with expectation. 

Both (43) and (44) employ the additive pula when the jackal is categorised with a de- 
scription.'? In (43), the additive pulak marks the existence of the newly introduced jackal 
as a somewhat unexpected addition to the discourse. The speaker perhaps contradicts 
the reasonable anticipation of the bird eating the fish, so the appearance of a hungry 
jackal presents an unexpected twist in the story.? After all, the fable is well-known, and 
it is reasonable to expect that the hearer is familiar with the plot. 


(43) Singapore Malay (2013.MLZ.JackalCrow.149) 
Dah | LAND, dia alih-alih ni ada [musang] pulak dia nampak. 
already cs.land 3s6 suddenly prox exist jackal ADD 38G see 


‘And as it landed, the crow suddenly saw that there was also a fox (there). 


In the second text, the jackal is introduced as the subject of an inverted existential clause 
with an enumerated classifier structure in (44). The jackal is linked to the already known 
crow with the relative clause, where the crow is the object of the involuntary action 
verb terlihat ‘happen to see’. The additive pula marks the newly introduced location, 
effectively extending the space in which the narrative is constructed. In terms of joint 
attention, the particle forces an update. It constructs the extension of the space in which 
the story takes place as unexpected or surprising. 


(44) Singapore Malay (2013.0G JackalCrow.04) 
Di satu ladang pula ada [se-ekor serigala] yang terlihat burung gagak 
in one field ADD exist one-CL.ANIMAL jackal REL spot crow 
itu terbang bersama ikan dalam mulut burung itu. 
DST fly together fish inside beak bird — per 


‘In a field, there was a jackal that saw the crow flying with the fish in its mouth: 


Example (45) shows the contrastive function of pula, where the benefit of the police 
action for the farmer's wife has to be considered in parallel with the punishment of her 


husband. 


18Note that there are several additive markers in Malay. Forker (2016: 91) discusses only pun as additive, while 
Goddard (2001: 27) calls both pun and pula emphatic. Sneddon et al. (2012: 236) considers both juga and 
pula additive markers, which indicate that "the focused part is an addition". 

P Malay speakers in Singapore are taught in Malay language composition classes that the particles pula and 
pun make their style “more interesting" or "engaging", and mark the "climax". We believe that at least 
in some cases, Malay speakers may be using these particles for such "aesthetic" reasons. The aesthetic 
function of pun, as a marker of a particular style is also discussed by Cumming (1991: 107). 
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(45) 


Singapore Malay (2017.HZ.09-10) 

Dengan itu, dia harus pergi ke, uh, pihak polis dan beritahu tentang apa yang 
therefore 3sG must go to HESIT police andinform about what REL 
terjadi. Um, [suami-nya pula ] berasa amat menyesal akan apa 
happen HEsIT husband-3Poss con.Foc feel very Av.regretful about what 
terjadi, dan beliau | amat risau tentang, um, apa yang akan terjadi 
happen and 3sc.HON very uneasy about HESIT what REL will happen 
kepada-nya iaitu, um, beliau harus di- di- di-letakkan di dalam lockup dan 
to-3 namely HESIT 3SG.HON must PV-place inside jail and 
di-belasah oleh pihak polis. 

pv-beat.up by police 

‘So now she had to go to the police and tell them what happened. Her husband 
(on the other hand) felt very regretful about what had happened, and he was 
very worried about what would happen to him, that is, he had to be detained in a 
jail cell and beaten by the police: 


The particle pula also occurs with left-dislocated locative elements. Its function appears 
to be to move the narrative along to another location. We have seen one example of this 
use in (44) and give another in (46) below. 


(46) 


Singapore Malay (2017.NZ.07) 

[Di balai polis pula ], uh,  isteri-nya pun mem-beritahu keterangan 
in station police ADD HESIT wife-3POSS TOP2 AV-report testimony 
tentang kejadian tersebut kenapa ia terjadi. 

about event GIVEN.Foc why 3sGhappen 

"Ihen at the police station, his wife explained that incident and why it had 
happened: 


By using pula, the speaker proposes a broadening or update of joint attention. In this 


function pula is similar to the demonstratives discussed in $5.1, because the "field" of 


joint attention remains essentially the same. In the next section we will discuss the use 


of lagi, another additive particle, whose use seems to be more restricted, but allows for 
scope manipulation. 


5.3 Particle lagi 


The particle lagi indicates repetition with predicates, but with adnominal quantifiers, it 
has an additive function. The additive function is illustrated in (47), where the particle 
highlights that the reference is made to all members of the group (Forker 2016: 84-85). 


(47) 
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Singapore Malay (2013.CA.PearStory.08) 
Dalam perjalanan pulang budak itu dengan tidak sengaja ter-langgar batu 
while drive home child  pisr with accident INVOL-hit rock 


lalu dibantu oleh [tiga lagi budak kanak-kanak] untuk mengumpulkan, 
then pv-helpby three more boy child to av.collect 
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mem-bangunkan basikal-nya dan buah-buahan yang ter-jatuh. 
AV-putupright bicycle-3Poss and fruit REL INVOL-fall 


‘On the way home, the boy accidentally bumped into stones and is assisted by 
three other young children to collect the bicycle and fallen fruits’ 


The additive lagi also creates a relationship with the boy, who is the topic of the sen- 
tence. Within the RefLex scheme, this referent is classified as r-new, but the presence 
of the additive marker suggests that this may be a referential type, not distinguished by 
the RefLex Scheme. In terms of joint attention manipulation, lagi emphasises the exis- 
tence of another referent which should be included in the focus. Additives are known 
to be scope sensitive (Forker 2016: 72). In (47), the additive marker follows the quanti- 
fier, highlighting the precise quantity of the added referents. In the next section we will 
discuss the use of pun, which essentially marks a proposal for a joint attention shift. 


5.4 Particle pun 


The particle pun is more frequent than other particles and demonstratives in our Getting 
the Story Straight corpus. This particle has received much attention in the literature, and 
is treated in the greatest detail in Goddard (2001), who provides an exhaustive overview 
of earlier studies (p. 29-30). In our discussion, we adhere to Goddard's analysis. The most 
common use in our data, is the “second-position pun" which highlights the sentence 
topic (Goddard 2001: 31). Cumming (1991: 107) suggests that pun is a resumptive topic 
marker attached to left-dislocated noun phrases. Its distribution is further affected by 
individuation of the referent, its semantic role, its introduction into the discourse, and the 
eventiveness of the description. The first function is well attested in our narratives; pun 
frequently marks a switch in topic as participants take over the agency in moving the 
plot forward. One such sequence is given in (48). 


(48) Singapore Malay (2017.ISM.11-13) 
Bila polis tiba, [pekebun pun ] di-tangkap. Di mahkamah, isteri-nya 
when police arrive gardener TOP2 Pv-catch in court wife-3Poss 
memberi, ah, tahu hakim apa yang telah terjadi. [Pekebun pun ] 
give HESIT know judge what REL already happen gardener TOP2 
di-jatuhkan hukuman penjara. 
Pv-hand.down jail.sentence 


‘When the police came, the farmer was arrested. In court, his wife told the judge 
what had happened. The farmer was then sentenced to a jail term: 


In (49), which follows directly from (21), pun amplifies the eventiveness of the sequence 
(i.e. the progress of the plot). Note that the translation attempts to capture this with the 
English adverb then in both sentences.?? 


? Note also the use of the active voice in both clauses, highlighting their eventiveness (cf. Djenar 2018: this 
volume). 
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(49) Singapore Malay (2017.AM.27-28) 
[Pak Samad pun ] men-ceritakan pengalaman-nya di dalam penjara dan 
Mr PN EVENT AV-tell experience-3POss inside prison and 
men-jelaskan bahawa dia menyesal dengan tindak-laku-nya. [Pak Samad pun ] 
Av-explain COMP  3sc AV.regret with — actions-3POss Mr PN EVENT 
ber-janji dengan anak-nya bahawa dia akan mem-bawa anak-nya ke 
Av-promise with child-3poss comp  3sc will Av-take ^ child-3»oss to 
jalan-jalan keesokan hari. 
walk.around the.following.day 


‘Pak Samad then told the story of his experiences in jail and made it clear that he 
regretted his actions. Pak Samad then promised his child that he would take him 
for a walk the next day. 


A similar instance of pun amplifying the progress of the plot (i.e. eventiveness) is shown 
in (50). In colloquial Singapore English, the particle pun is often translated with then, 
which has the same function in marking the previous event as completed and a new one 
as commencing. 


(50) Singapore Malay (2017.ISM.19-20) 
Dia sangat gembira dapat me-nikmati cahaya matahari. [Pekebun pun ] pulang 
3sc very elated get  Av-enjoy sunlight gardener EVENT return 
ke rumah-nya. 
to house-3Poss 


‘He was very happy that he got to enjoy the sunshine. The farmer then returned 
to his house: 


Goddard (2001: 54) reports that the topic focus function is the most common in his written 
Malay corpus. In our narrative data, the event sequence function is more common. An 
instance of topic focus is given in (51), where the jackal, upon spotting the crow with the 
fish, is constructed as talking to itself. 


(51) Singapore Malay (2013.MLZ JackalAndCrow.151) 
[Aku pun ]lapar ah. 
1sG TOP2 hungry PART 


‘Tam also hungry: 


The presence of resumptive topic resets the reference of the third person pronoun dia 
and ia. In (52), the jackal is referred to as ia, while the fish and crow require nominal ex- 
pressions. The minimisation of the expression of the topic after it has been focused with 
pun resembles the general tendency for minimisation of reference (Heritage 2007; Sacks 
& Schegloff 2007). We take this as a signal that pun indicates a shift of joint attention to 
a new "field", which is accompanied by a reset in the scope of anaphoric devices. 


?!Hiroki Nomoto has suggested to us that perhaps the core function of the particle is to indicate a clause 
relationship between two clauses which are told in their order of occurrence, but the particle has to be 
placed after the subject of the second clause (Nomoto 2017: pun). 
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(52) Singapore Malay (2013.OG.JackalAndCrow.05-6) 
Jadi [serigala itu pun ] ingin me-makan ikan itu kerana [ia] sangat lapar. 
so jackal bist ToP2 wish av-consume fish psr because 3sG very hungry 
Jadi [ia] fikir [ia] mahu ikan yang di dalam mulut burung gagak itu. 
so 3sG think 3sG want fish REL inside mouth crow DST 
"Ihe jackal wanted to eat the fish because it was so hungry. And it thought, it 
wanted the fish in the crow's mouth’ 


Example (53) summarises the outcome for the crow and clearly illustrates the event se- 
quence focus function of pun (cf. Goddard 2001: 38). 


(53) Singapore Malay (2013.MLZ JackalAndCrow.173) 
[Gagak pun] sedih sebab dia kena tipu, bosan. 
crow EVENT sad because 3sG PAss cheat disgusting 


"Ihe crow was sad because it got cheated, disgusting: 


The particle pun does not occur in our texts with inanimates, but this is just a conse- 
quence of the construction of the plot in the narratives which we focus on here. There 
are instances of its use in our Singapore Malay corpus, such as (54), which describes 
the shortage of rice during WWII. Pun here highlights the food shortage as a local topic 
and brings the focus on porridge, lexically tracking the topic beras (RefLex r-given, l- 
accessible-sub). 


(54) Singapore Malay (2016.BIZ.45) 
Memang takde jumpa beras, lah, nanti masak, ah, bikin bubur ke, bikin, 
indeed not find rice PARTlater cook rop make porridge or make 
kalau dapat bubur pun dah bagus lah, sekali-sekali, pun nak taruk 
if get porridge TOP2 already good PART occasionally EVENT MOD put 
keledek, taruk ubi. 
sweet potato put tapioca 
"We couldn't of course find rice, when we cooked porridge for instance, if we got 
porridge it was already very good, once in a while, still we had to add sweet 
potato and tapioca. 


5.5 Demonstratives and particles 


Demonstratives may be followed by the particle pun. An eventive pun can be seen in 
(55). The speaker confuses the plot, and refers to the wife where the husband is meant. 


(55) Singapore Malay (2017.NZ.04) 
Dalam kemarahan, uh, [lelaki itu pun ] pergi, uh, pergi ke suami-nya, 
in anger HESIT male DIST TOP2 go  HESIT go to husband-3Poss 
eh,  keisteri-nya dan marah, dan marah suami-nya kenapa dia berbual 
HESIT to wife-3poss and angry and angry husband-3poss why 3sG converse 
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dengan lelaki lain. 
with male other 


‘In anger, that man then went to his wife and scolded his husband [sic] for 
talking to other men: 


In (56), the farmer is described as suami tu pun. The particle pun prompts the hearer to 
attend to the temporal sequence, while the demonstrative tu places the focus on the same 
referent. The distal may encode the wife's apprehensive stance towards her husband. 


(56) Singapore Malay (2017.NZ.14) 
Dia mem-beritahu tentang, aah, keadaan-nya disitu dan bagaimana dia 
3sG Av-tell about HESIT situation-3POSs there and how 3SG 
insaf | dan rasa kesan terhadap kejadian-nya tersebut. Dari hari 
penitent and feel consequence about ` incident-3Poss GIVEN.Foc from day 
itu, [suamitu pun] tidak me-minum arak lagi dan tidak ber-campur 
DIST spouse DIST EVENT not AV-drink alcohol again and not Av-mix 
dengan kawan-kawan tersebut. 
with —RED-friend ` GIVEN.FOC 


‘He told them about the conditions there and how he regretted and felt the 
effects of that incident. From that day on, the husband did not drink alcohol any 
more and did not mix with those friends: 


Multiple demonstratives can combine within a single description, as in (57), where the 
noun budak laki ‘boy’ is followed by the recent mention tadi and itu. 


(57) Singapore Malay (2013.LN.PearStory.18) 
[Budak lelaki tadi yang menunggang basikal itu] te-nampak se-orang 
boy RECENT REL AV.ride bicycle DIST INVOL-spot one-CL.HUMAN 
(1s) budak perempuan yang juga menaiki basikal yang bertentangan, yang (1s) 
child female REL also Av.travel.by bicycle REL opposite REL 
berjalan bertentangan dengan-nya. 
travel opposite with-3 


"Ihe boy who was riding the bicycle saw a girl who was also riding a bicycle in 
the opposite direction: 


Table 11 sketches the functions of Malay demonstratives and particles in manipulating 
and directing the interlocutors' joint attention. The effect is captured with simple verb 
phrases — a conventionalised terminology remains lacking.” This representation also 
draws on the idea of cognitive states developed in Gundel et al. (1993) but takes the 
attention asymmetry between the interlocutors as a starting point. The effects fall into 
two groups, depending on whether the “field” of joint attention remains the same or 


shifts. 


?"Tomasello (1995) offers a lucid account of the development of social cognition and the ability to manipulate 
joint attention in children. 
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Within the same field, the proximal ini requires a symmetrical manipulation of joint 
attention, while the remaining deictic forms indicate a reorientation of attention on the 
side of the speaker and require a manipulation of the focus on the side of the hearer 
so that joint attention can be renewed. The most forceful reorientation within the same 
field is encoded with the epistemic particle pula(k), which indicates a surprise or novelty 
on the side of the speaker (captured here as "update"). Finally, the particle pun encodes 
a shift of joint attention and entails a reset of anaphora, exemplified in (52). 


Table 11: Joint attention manipulating functions of Malay demonstratives and 


particles 

DEMONSTRATIVE JOINT ATTENTION MANIPULATION 
SPEAKER HEARER 

si bring in focus activate familiar 
ini keep in focus keep in focus 
itu bring in focus access 
tadi bring in focus recall recent 
tersebut bring in focus recall known 
pula(k) update/broaden ` update/broaden 
lagi add in focus add in focus 
pun shift shift 


6 Conclusions 


A systematic comparison of elicited narrative texts organised in a parallel corpus enables 
us to make several points about Malay discourse and information structure: 


e The speaker's stance is reflected in referent categorisation and has consequences 
for referent tracking. 


e The stronger epistemic stance simplifies expression of referents and their tracking, 
confirming the claim by Sacks & Schegloff (2007) that categorisation of humans 
with proper names require less recognitional effort, as shown in $4. 


e The neutral epistemic stance generally motivates referent categorisation with de- 
scriptions, which need to be more elaborate to track referents effectively. 


e Taking a stronger epistemic stance, the speaker can construct and maintain differ- 
ential perspectives on the referents through their categorisation, such as Adam vs. 
her husband, or her father (Stivers et al. 2007). 
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e The variation of expression correlates with the referential status of the referent 
as well. The high referential status allows for tracking with pronouns, but the low 
status disfavours enumeration and classifiers. 


e The topic focus particle pun is preferred with more complete expressions of a third 
person referent, after which the reference can be minimised (zero, dia, ia, etc.), as 
argued by Heritage (2007); Sacks & Schegloff (2007). 


* Both topical and focused participants are eligible for minimisation, but the remain- 
ing referents require a fuller expression, for non-humans typically a N^ itu phrase. 


Future work will focus on the role of word order and verbal morphology as well as 
on the effect of downgrading the role of the hearer to a silent listener, unable to interact 
where joint attention is not achieved (DeLancey 1997). Our parallel corpus contains such 
information in the negotiations preceding the presentation of the agreed narrative. 
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Abbreviations 
1, 2,3 person markers INVOL involuntary agent ter- 
AFF affected HESIT hesitation marker 
APPL applicative MOD modal auxiliary 
AV active voice PASS passive auxiliary kena 
COMP complementizer bahawa, POSS possessive 
yang PROX proximate 
cs code-switching PV passive voice 
DEM demonstrative TOP topic 
DIST distal TOP2 switched topic 
FOC focus (Goddard’s topic focus) 


INTENS intensifier 
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Attention focus and information 
packaging in Vera'a demonstratives 


Stefan Schnell 


Centre of Excellence for the Dynamics of Language & University of Melbourne 


I investigate endophoric uses of demonstratives in the Oceanic language Vera'a (North Van- 
uatu) in relation to their exophoric use, drawing on different types of linguistic data, in par- 
ticular conversations during the building of a house and narrative texts. I find that Vera'a 
demonstratives mark a three-way distinction of speaker-oriented (excluding the addressee) 
versus addressee-oriented (including the speaker) versus distal (excluding both speaker and 
addressee). Seeking to determine the core meaning of the three demonstratives that would 
square with both their exophoric and their endophoric use, I develop the hypothesis that 
considerations of joint attention focus rather than localisation are its central aspect: speaker- 
oriented forms are used to draw the addressee's attention to an entity, addressee-oriented 
forms to maintain attention to an entity, and distal forms to delay attention to an entity. The 
joint attention meaning of demonstratives in endophoric use yields effects in the domain 
of information packaging: speaker-oriented forms are used to introduce new information; 
addressee-oriented forms in specific syntactic contexts yield effects akin to topic and focus; 
distal forms are used to keep track of background information. Hence, attention focus is the 
invariable core meaning of Vera'a demonstratives that accounts for all senses in different 
contexts of use. 


1 Introduction 


The role of demonstratives in discourse has been the focus of a growing body of corpus- 
based research, starting with Himmelmann (1996). Recent work has been undertaken for 
the Australian language Dalabon by Cutfield (2012) and the Oceanic language Logea by 
Dawuda (2009). The central question concerning the endophoric (inner-textual) usage of 
demonstratives is its relationship to their exophoric (deictic) usage. One line of thought 
sees discourse essentially as a metaphorical extension of space (Diessel (1999) and Büh- 
ler (1999[1934]: 121)), with spatial distance being equivalent to anaphoric (antecedent) 
distance, and referent activation. An example of this is Terrill (2001), who examines the 
function of two demonstratives, foia and oia, in discourse from Lavukaleve: she finds 
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that the use of these two forms in endophoric use mirrors their function to mark dis- 
tance distinctions (proximal versus distal) in situational use, so that spatial distance in 
the physical environment corresponds to antecedent distance within discourse, for in- 
stance how many clauses lie between an anaphor and its antecedent within a given text. 
Other authors stress the speaker/addressee orientation of demonstratives, but still see 
them merely as anchor points for marking spatial relations between speaker, referent, 
and addressee. This view is reflected in Margetts (2004). Yet other authors also focus on 
the speaker/addressee orientation of demonstratives, but see the use of demonstratives 
as a reflection of communicative interaction that can override 'objectively' determined 
spatial circumstances (see Enfield 2003 on Lao). In this line of thought, Mosel (2004) 
shows that Samoan speaker-oriented demonstratives are often used to introduce new in- 
formation into discourse, whereas speaker-plus-addressee-oriented demonstratives are 
used for given referents, reflecting the knowledge state of speaker and addressee during 
discourse production/comprehension rather than their location in space relative to each 
other or to the entity referred to. This paper lends further support to this latter view, 
and it links the significance of knowledge states of interlocutors to that of joint atten- 
tion. On this view, demonstratives can be characterised as linguistic devices that serve 
to "coordinate the interlocutors' joint focus of attention" (Diessel 2006: 481). 

In this paper I present findings from a study of the use of demonstratives in the 
Oceanic language Vera'a. I argue that Vera'a demonstratives do not have spatial-local- 
ising but attention-focussing meaning. Typical for an Oceanic language from Melanesia, 
Vera'a has a rich inventory of interrelated demonstrative forms that can be divided into 
three functionally defined sets of forms: 1. speaker-oriented (DEM.SPKR), 2. addressee- 
oriented (DEM.ADDR), and 3. neither speaker- nor addressee-oriented (distal) (DEM.DIST). 
In instances of exophoric reference, these forms do not merely serve the localisation of 
an entity in terms of their distance relative to either speaker, addressee, or both. Rather, 
itis the awareness state of participants in the communicative situation that is crucial for 
their exophoric use, serving to coordinate the attention focus of speaker and addressee 
(Diessel 2006). Thus, addressee-oriented forms are used where speaker and hearer share 
awareness of a referent and attend to it, even when the referent is located in the speaker's 
here-space. Conversely, speaker-oriented forms can be used where a referent is at a large 
distance from both speaker and addressee if the speaker believes the addressee to not 
be aware of or not attending to a referent they want them to attend to. Distal forms are 
used to refer to something that speaker is not focussing their attention on. 

Crucially, the same considerations of attention focus account for the endophoric use 
of demonstratives in narrative texts (Diessel 2006: 476): here, speaker-oriented forms are 
restricted to two cases of Deixis am Phantasma (Bühler 1999[1934]: 121-140), a subtype 
of situational use in Himmelmann (1996: 222) (see $3.3 for a more detailed character- 
isation): 1. announcements of a narrative that is about to be produced by the speaker, 
and 2. cases of direct speech, where the narrated situation is rendered in analogy to the 
physical context of the actual speech situation. These uses are classified as exophoric 
here. In endophoric use, the choice between addressee-oriented and distal forms is sen- 
sitive to referential distance; however, the use of addressee-oriented forms can yield 
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information-packaging effects in specific discourse contexts, often corroborated with 
other structural properties, for instance left-dislocation and emphatic particles. I argue 
here that these information-packaging effects do not constitute an extended meaning 
of demonstratives, but follow straightforwardly from their attention-focussing meaning: 
speaker-oriented forms are used cataphorically to draw the addressee's attention to the 
narrative that is about to ensue, whereas addressee-oriented forms are used anaphori- 
cally, but serve to reinforce joint attention to already shared and activated information. 
This latter function then contributes to information-packaging effects, which are essen- 
tially a matter of pragmatic inference and the semantics of given syntactic and discourse 
contexts. Addressee-oriented forms are thus not simply used to mark the activation of 
a referent, and other forms of reference - typically pronouns - are used for activated 
referents. Distal forms, on the other hand, are used to re-activate a referent. 

The paper unfolds as follows: 82 provides a brief overview of the Vera'a language and 
its speakers, and the corpus data underlying this study. In 83, I first define the class of 
demonstratives in Vera'a, and related demonstrative adverbs derived from them, and 
describe their syntactic distribution. I then turn to the description of exophoric uses, 
followed by endophoric uses. In $4, I discuss the findings with regard to spatial and 
attention-focus accounts of demonstratives. In $5, I conclude the paper, and provide an 
outlook on future research. 


2 Language data and methodological notes 


Vera'a (ISO 639-3: vra) is an Austronesian language of the Oceanic subgroup spoken 
by 500 people on the island of Vanua Lava, the largest of the Banks group of islands 
in North Vanuatu. It belongs to what Kalyan & Francois (to appear) call the “Vanuatu 
linkage’. Vera'a is spoken in close vicinity to Vurés (Malau 2016) and Mwotlap (Francois 
2001), and adult speakers are usually fluent in the former, and often at least understand 
the latter. All speakers of Vera'a above the age of approx. 3 years speak Bislama, the 
English-lexified contact language that has the status of Vanuatu’s ‘national language’. 
Nonetheless, Vera'a is not severely endangered and it is very vital: most children grow 
up (until the age of about 3) monolingually with Vera'a, which is used virtually in all do- 
mains of everyday communication, except for some church services. Immigrants into the 
Vera'a community usually acquire Vera'a, but may continue to use their first language 
with their children in their homes (this concerns mainly women from neighbouring com- 
munities who marry into the Vera'a community) (cf. François 2012). The language is 
endangered only due to the small size of the community, so that a language shift, for in- 
stance to Vurés, could potentially occur within a short time frame if the positive attitude 
towards the language should falter in the future. 

This study draws on different types of text data in the Vera'a language documentation 
that I have been compiling since December 2006 in close collaboration with speakers 
of the language (http://dobes.mpi.nl/projects/vures veraa/). The documentation encom- 
passes recordings of a wide range of communicative events, ranging from narratives — 
produced mainly for the sake of being documented - to public speeches and (very few) ca- 
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sual conversations, which represent the most naturally occurring communicative events. 
Particularly relevant for this study are a set of video recordings documenting the con- 
struction of a larger house (approx. 4 hours), audio recordings of plant (150 descriptions, 
30sec average recording length) and fish (254 descriptions, 1.5min average length) de- 
scriptions, and a collection of narrative texts (65 narratives of different lengths), most 
of which were only audio-recorded. Video recordings exist for a handful of narratives, 
and I will point to specific visual clues where relevant in the discussion to follow. Note, 
however, that for the purposes of this study, neither gestures nor prosodic structures 
were taken into account systematically, and a systematic and sound analysis of these 
two aspects of language use will have to be left for future work on the language. The 
time-aligned annotation of the audio signal in all recordings was segmented into pause 
units. 

A smaller set of 10 narratives from different male and female speakers of different 
ages - some traditional legends and myths, some more similar to (moral) fables in the 
European tradition - contain rich morphosyntactic annotations that serve as the basis 
for some quantitative examination into the use of demonstratives in Vera'a. The texts 
show some variation in length, ranging from 178 to 929 clause units. Table 1 gives an 
overview. 


Table 1: GRAID-annotated corpus of Vera'a narrative texts 


TextID Speaker age text no. of 

initials group type clause units 
ANV AN 18-25 narrative 208 
AS.1 AS 35-50 narrative 224 
GABG GA 35-50 narrative 178 
GAQG GA 35-50 narrative 232 
HHAK HH 18-25 narrative 446 
ISAM IS 50+ narrative 248 
ISWM IS 50+ narrative 608 
Joe JJ 50- narrative 929 
MVBW MV 25-35 narrative 314 
PALA PH 35-50 narrative 402 
Totals 3789 


All of these ten narratives are part of the Vera'a corpus Schnell (2016) within the Multi- 
CAST collection (Haig & Schnell 2016; https://lac.uni-koeln.de/de/multicast/). Like all 
other text corpora in Multi-CAST, the Vera'a texts have been annotated according to 
the GRAID conventions (Haig & Schnell 2014), reading 'Grammatical Relations and Ani- 
macy in Discourse’. This annotation scheme has been designed specifically for purposes 
of research into referentiality, argument structure and discourse structure, as pioneered 
by Wallace Chafe and Talmy Givón, and associates. Essentially, GRAID annotations cou- 
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ple glosses for form (zero, pronoun, full NP) with those for syntactic functions (s, a, p, 
etc.), as well as information on person and animacy. The Vera'a corpus also contains an- 
notation of phrasal sub-constituents, merely indicated by symbols for “word to the right 
of NP head" (<rn>) and “word to the left of NP head" (Im), and likewise for constituent 
words in complex verbal predicates. Also, adjuncts and other clause-level constituents, 
like adverbs and particles, have been annotated as «other. 

In the study of demonstratives reported here, GRAID annotations were used systemat- 
ically together with morphemic glossing: since both layers of annotation are symbolically 
associated (and the respective type has been used for the GRAID tier relative to the mor- 
phemic glossing tier in ELAN), all demonstrative forms can be searched with respect to 
their syntactic function, either as an argument, a clause-level adjunct/particle, or an NP- 
level satellite constituent. Thus, using ELAN's layered search function, one can easily 
determine figures for and all instances of say DEM.ADDR forms occurring on NP level by 
searching for <dem.addr> on the morphemic glossing tier and <rn> on the GRAID tier. It 
was thus possible to determine a total of 759 instances of demonstrative forms in use in 
this corpus, comprising 528 basic demonstratives and 231 demonstrative adverbs. Since 
GRAID triggers basic features of referring expressions, we can also compare quantita- 
tively the use of demonstratives in relation to NPs without demonstratives and other 
forms, in particular zero and pronoun. This latter investigation is, however, beyond the 
scope of this paper, and is planned to be undertaken in the future. 

In this paper, I generally draw on the wider Vera'a documentation, discussing different 
types of data in turn. It will thus be clear during the discussion from which type of 
data a specific example has been drawn. The Vera'a Multi-CAST corpus serves merely 
to establish a quantitative picture of syntactic distributions of different demonstrative 
forms in narrative texts. 


3 Demonstratives in Vera'a 


In this section, I first define demonstratives in Vera'a following Himmelmann (1996) and 
describe the inventory of basic demonstratives and derived demonstrative adverbs in 
83.1, and their syntactic distribution in $3.2. I then turn to their exophoric and endophoric 
uses in 83.3 and $3.4 respectively. My use of these two terms is adopted from Halliday 
& Hasan (1976: 33) in the following way: exophora is a direct relationship between a 
linguistic expression and an extra-linguistic entity; this is essentially the same as deixis. 
Endophoric relations can be defined in analogy to the definition of anaphora, as stated 
by Huang (2000: 1): 


[An anaphora; added by Stefan Schnell] is a relation between two linguistic ele- 
ments, wherein the interpretation of one (called an anaphor) is in some way deter- 
mined by the interpretation of the other (called an antecedent) [...]. 


I use endophoric here as a hyperonym to both anaphora and cataphora, where the latter 
defines a relationship of “carrying forward" rather than backward. Like Huang (2000) 
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for anaphora, I understand endophoric relations as essentially quite vague and general, 
relying to a large degree on pragmatic inference. Thus, in determining whether a refer- 
ring expression is exophoric or endophoric (for instance a definite full noun phrase), 
interlocutors will seek different clues as to whether they require to consider the dis- 
course context of the same text, or whether they should resort directly to the physical 
surroundings of the speech situation in order to establish its reference. Although some 
form types are more common than others in one of the two types of relation, referential 
form classes are generally not specified for their type of reference. For instance, a per- 
sonal pronoun in Vera'a can be endophoric or exophoric, regardless of the fact that the 
former use is possibly more common than the latter. Likewise, demonstratives are not 
per se exophoric. 

Exophoric uses of demonstratives are attested in two different communicative con- 
texts and related recordings, namely in non-narrative contexts (house building videos, 
observations) and in or immediately connected to narratives. In the latter context, they 
come in two different types: 1. the introduction of a narrative, and 2. instances of Deixis 
am Phantasma, in the sense of Bühler 1999[1934]: 131: 


[...] wenn ein Erzáhler den Hórer [...] ins Reich der konstruktiven Phantasie führt 
und ihn dort mit denselben Zeigwórtern traktiert, damit er sehe und hore, was es dort 
zu sehen und zu hören [...] gibt. Nicht mit dem äusseren Auge, Ohr usw., sondern mit 
dem, was man [...] das "innere" oder ” geistige” Auge und Ohr zu nennen pflegt. [[...] 
when the narrator leads the hearer [...] into the realm of constructive imagination 
and treats him to the same deictic words as before so that he may see and hear 
what can be seen and heard there [...]. Not with the external eye, ear, and so on, 
but with what is [...] called the “mind’s” eye or ear [...] (English translation by 
Donald Goodwin and Achim Fraser Eschbach)] 


Although we may expect this special kind of deictic reference within direct speech to 
work somewhat differently from deixis within the actual speech act frame, I take the 
similarities eluded to by Bühler sufficient to treat them basically alike here (but I will 
point out to the reader what type is represented by any given example). Before turning to 
the exophoric and endophoric uses of demonstratives, I shall provide a definition of these 
forms together with their inventory in Vera'a, and outline their syntactic distribution. 


3.1 Definition and inventory of demonstrative forms 


Vera'a distinguishes three basic forms of demonstrative, marking a three-way opposi- 
tion between speaker-oriented (DEM.sPKR), addressee-oriented (DEM.ADDR), and distal 
(DEM.DIST). What is labelled here 'addressee-oriented' is understood as essentially en- 
compassing the sphere of both speaker and addressee, as will become clear below. The 
three forms fall under the definition put forward by Himmelmann (1996: 210), albeit with 
one qualification. According to Himmelmann, demonstratives have the following neces- 
sary criteria: 
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(1) Demonstratives are linguistic expressions that 
1. are in a paradigmatic relationship to elements that serve to locate a referent 
on a distance scale, i.e. proximal, distal, etc. 
2. cannot be used with reference to entities that are uniquely identifiable in 
a culture (or the world), or in so-called associative anaphora, e.g. where an 
anaphor is a part of the whole expressed by the antecedent 


As for criterion 1, in situational use DEM.SPKR forms can be used to identify a referent that 
is located close to the speaker, whereas DEM.ADDR forms can be used where it is further 
away from the speaker, possibly between addressee and speaker. The form DEM.DIST can 
be used for referents even further away from the speaker, possibly at a large distance 
from both speaker and addressee. Examples in 83.3 will illustrate these points. While 
these uses concern the extension of demonstratives, their intentional definition in Vera'a 
does not draw precisely on the distance parameter, as will become clear in the sections 
to follow. Moreover, it seems that the Vera'a facts provide some illustrative evidence that 
the exophoric-situational use is not necessarily basic to the meaning of demonstratives, 
as assumed in earlier work by Diessel 1999: 93. Instead, my findings support the view that 
the meaning of demonstratives is general with respect to exophoric and endophoric uses 
(cf. Himmelmann 1996: 242), and is indeed grounded in considerations of interlocutor 
interaction and ‘attention focus’, as found by Enfield (2003) for Lao (cf. also Diessel 2006). 
Table 2 summarises the inventory of basic and adverbial demonstrative forms. Forms of 
DEM.SPKR and DEM.ADDR alter between a 'plain' form and a form prefixed with a-. The 
latter is quite likely to represent a crystallised combination of the specific locative article 
a and a demonstrative. The formal distinction is relevant insofar as they correlate with 
constraints on syntactic distribution: for instance only the prefixed but not the plain 
forms of basic demonstratives can occur adnominally. Where a form is prefixed, this is 
indicated by .a in the gloss, for instance DEM.sPKR.A for the prefixed speaker-oriented 
base form. 


Table 2: Inventory of Vera'a demonstrative forms 


DEM.ADDR DEM.SPKR DEM.DIST 
PLAIN A-FORM PLAIN A-FORM 
BASIC né ané gén(é) agén(é) é 
MANNER sené asené segén(é) asegén(é) 
TEMPORAL va'ané va'agen 
LOCATIVE (‘e)kéné (‘e)kégén 


Table 2 also lists demonstrative adverbs in addition to basic demonstratives. These are 
derived from the base forms as follows: manner adverbs are derived by adding a prefix se- 
whose exact meaning and origin are unclear at this stage. Temporal adverbs essentially 
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consist of an adverb va'a ‘already’ plus the base form. Locational adverbs are combina- 
tions of the noun 'eke ‘place’ and a basic demonstrative. All three types of adverb can be 
used adnominally and occupy the same phrase-final slot as base forms (compare example 


(3): 


(2 n= gie si -n  gengensi =n sava sene di =m wolwol 
ART- kava or =ART food or =ART what MAN.DEM.ADDR 3SG =TAM1 RED:buy 
ma mé no. (0.6) 
hither DAT 1sG 


‘(He gave me money for the work.] Also kava, food, or such things, he bought for 
me. MVRP.080 


I will be dealing mainly with the use of base form demonstratives in this chapter. How- 
ever, I will occasionally outline particular usage properties of DEM.SPKR and DEM.ADDR 
by reference to adverbs where particularly illustrative examples show that the findings 
presented here apply to the wider demonstrative system. 

As for Himmelmann’s second criterion, the forms in Table 2 are indeed incompatible 
with the first usage context, but a single example is attested that is similar to the binding 
(associative) anaphora example mentioned by Himmelmann (1996: 210): 


(3 qon vowale ruwa mē =n  gunu-ruo duru=m 'ogo ‘ogo vaa-van 
night one PERS HUM:DL DAT =ART spouse-3DL 3DL =TAM1 stay stay RED-go 
-n rene ane ne wotoqtoqo. 
=ART woman DEM.ADDR.A AOR:3SG pregnant 


“One day, (there was) a couple. They stayed and stayed (and after some time) that 
woman got pregnant: ANV.001-003 


In this example, the speaker first introduces the couple, and two clauses later refers to one 
in the couple, the woman, using an adnominal DEM.ADDR. This usage would disqualify 
the forms in question as demonstratives according to Himmelmann's second criterion. 
Thus, if we take this particular use of a DEM.ADDR form as evidence against its status as 
a demonstrative we will probably have to conclude that the language does not possess 
demonstratives at all, given the clear paradigmatic relation of the form with DEM.sPKR 
and DEM.DIST forms. This would in my view not be a plausible analysis, given the univer- 
sality of demonstratives (and hence their expectedness in Vera'a) and the otherwise clear 
indications that the forms in question function like demonstratives in other languages. 
There are two possibilities to account for this ‘abnormal’ use of a demonstrative: firstly, 
we might be dealing simply with a production mistake, given that it is the only example 
in the entire corpus. Though this is somewhat speculative, it is not impossible that speak- 
ers occasionally ‘miscalculate’ the identifiability of referents, believing that there is an 
eligible antecedent expression in the discourse context, although there isn’t (Ariel 1990: 
72). The second possibility is that Himmelmann’s criterion is too narrow, and that Vera'a 
treats associative anaphora differently from other languages. Which of these two possi- 
bilities is more appropriate remains an open question at this point, and is left for future 
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investigations of more, and possibly more varied, discourse data to determine whether 
such uses are more common with some speakers or in some contexts than our current 
data suggests. 


3.2 Syntactic distribution of demonstrative forms 


For base form demonstratives, three syntactic positions and respective functions are 
attested: NP-internal modifier (adnominal use), argument on clause level (pronominal 
use), and clause-level adverbs and particles. Table 3 provides figures for the frequencies 
of uses of DEM.ADDR, DEM.SPKR, and DEM.DIST in all three syntactic contexts, with the 
restriction that DEM.DIST cannot be used pronominally. 


Table 3: Frequencies of basic demonstratives in 10 Vera'a narratives 


DEM.ADDR DEM.SPKR DEM.DIST Totals 


ADN PRO Other ADN PRO other ADN other ADN PRO other 


255 20 127 21 4 10 74 17 350 24 154 


All three forms can be used adnominally, as modifiers in the noun phrase, as shown 
by the following examples (4) and (5): 


(4) ne ene ruwa malmalae =s sagi e ruwa 
AOR:3SG see PERS HUM:DL girl DEM.DIST =SIM sit DEL PERS HUM:DL 
malmala ane so 
girl DEM.SPKR.A QUOT 
‘[...] and then (he) spotted those two girls (who were) sitting (there), and the two 
girls said: ..’ 1.PALA.059-060 
(5) no =k kaka -n  nelno vu' agene 
1SG AOR:1SG story =ART voice spirit DEM.SPKR.A 
‘Iam going to tell this (the following) story: ..? ISWM.001 


The functions of these different forms in adnominal use will be discussed in $3.3 and 
$3.4. Pronominal use is restricted to DEM.ADDR and DEM.sPKR forms. While DEM.ADDR 
demonstratives occur as subjects (ex 6) and predicates (ex 7) in non-verbal clauses, all 
pronominal uses of DEM.sPKR are instances of subjects in non-verbal clauses (ex 8): 


(6) ne -n  tektek mu-n e Qo. 
DEM.ADDR =ART RED:speech POSS.GEN-CS PERS Q. 
"Ihis was Qo"s speech [what Qo' said]: JJQ.306 
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(7) bul wova'al ë di sa nē 
bunch pawpaw DEM.DIST 35G EMPH DEM.ADDR 


‘That bunch of pawpaw (over there), it is exactly that [which we had in mind]’ 
GABG.075 


(8) kamaduo gen. 
IDLEX DEM.SPKR 


“This (is) us. 2.PALA.125 


Pronominal uses of demonstratives are very rare, in particular where DEM.sPKR forms 
are involved. These cases are in fact restricted to contexts of Deixis am Phantasma in 
narrative texts, to be discussed in 83.3. This corroborates the findings from the sample 
presented in Himmelmann (1996: 215) for texts from different languages. 

Similar to other Oceanic languages (Mosel (2004) on Samoan; Margetts (2004) on Sal- 
iba; Francois (2001) on Mwotlap), basic demonstratives in Vera'a can occur on clause 
level in clause-final position, for example: 


(9) e raga ane -k tek so =n too me ma’ ne 
PERS HUM:PL DEM.SPKR.A —AOR:NSG Say QUOT =ART fowl FUT dead DEM.ADDR 
‘And they said: “These chickens will die!” ANV.037 


(10) dirol-k önön '"i ane 
3TL =AOR RED:lie DEL DEM.SPKR.A 


‘They lay down [and shortly after that ...]’ MVBW.109 
(11) e Qo'ne sagrén sag kēkē e 

PERS Q. AOR:3SG sit stuck up above DEM.DIST 

‘But Oo is trapped up (in the tree), have you forgotten?’ JJQ.151 
(12) kamam miir sa — gene le =n wio. 

1PL.EX.TAM1 sleep EMPH DEM.SPKR LOC =ART bamboo 

“Let's sleep right here, in the bamboo: JJQ.354 


In examples (9) and (10), DEM.ADDR forms occur in clause-final position. In (9), it functions 
as an assertive particle that serves to underscore the speaker's conviction about the truth 
of a proposition. In (10), it fulfils a clause-combining function, details of which will be 
discussed in 83.4. The same function is carried out by DEM.DIST in this position (ex 11). 
Basic DEM.SPKR forms always function as locative adverbs when occurring on clause 
level (ex 12). 

Demonstrative adverbs are used mainly adverbially on clause level (category ‘other’ 
in Table 4). However, they do occur - in rare instances - as modifiers in the noun phrase. 
Table 4 lists the relevant figures for frequencies of use. 

I confine myself here to presentation of a single example of a locative adverb in ad- 
nominal function (see also example (2) above): 
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Table 4: Distribution and frequency of demonstrative adverbs in 10 narratives 


DEM.ADDR DEM.SPKR 'TOTALS 


ADN other ADN other ADN other 


MANNER 2 57 3 24 5 81 
TEMPORAL 1 61 0 5 1 66 
LOCATIVE 4 57 1 16 5 73 
TOTALS 7 175 4 45 11 220 
(13) ‘eré ‘ama’ 'ekene dir -ém van se-serge ma. 
PL devil LOC.DEM.ADDR 3PL =TAM1 go RED-together hither 
"Ihe spirits (of) here, they came together here? ISWM.281-282 


In the following subsections, I will focus on the use of basic demonstratives, and only 
occasionally turn to the use of demonstrative adverbs. 

Before we turn our attention to the different uses of demonstratives, it should be noted 
that adnominal demonstratives are the only NP-level element that directly influences the 
referential interpretations of NPs. Pre-head articles are merely phrase markers whose 
only function is the marking of the type of phrase, and the subtype of NP (common 
versus personal). NPs with adnominal demonstratives are definite, and have an identi- 
fiable referent. We will see below that this is merely an entailment of demonstratives, 
and not their core meaning. NPs without demonstratives are likewise regularly used in 
contexts where their referent is identifiable, but they are also compatible with discourse- 
new reference. As for pronominal uses of demonstratives, these may be relatively rare 
in the corpus because third person pronouns are regularly used for anaphoric and - to 
a lesser degree - deictic reference to identifiable entities. As indicated above, this paper 
will be confined to the development of a first hypothesis concerning the use and mean- 
ing of Vera'a demonstratives. A systematic corpus study of the use of demonstratives in 
competition with other types of referring expressions will be left for future work on the 
language. 


3.3 Exophoric use of Vera'a demonstratives 


The exophoric functions of demonstratives are often seen as basic, for instance in Diessel 
(1999). While it is the exophoric use of demonstratives that defines the class as a whole 
in Himmelmann's (1996) definition, it is questionable whether spatial considerations are 
part of the core meaning of demonstratives. Himmelmann himself doubts this, as do do 
studies like Enfield (2003), as well as later work by Diessel (2006). My findings concern- 
ing the use of demonstratives in Vera'a lend further support to the view that the basic 
function of demonstratives is in interlocutor interaction and attention focussing. 
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The purpose of this section is to provide an outline of exophoric uses of the three 
demonstratives in Vera'a. I will here draw on two types of data, namely the house build- 
ing video and observation where speakers point to entities and locations in the physical 
environment of the recording. Secondly, I will illustrate the same patterns of demonstra- 
tive use in instances of Deixis am Phantasma in the direct speech of characters pointing 
to entities within the narrated reality. The last type of exophoric use to be discussed here 
is in a narrative's frame, in particular where a narrative is announced just prior to its 
narration, and where it is wrapped up after the last events have been reported. 

It should be noted here that at this stage, no more systematic investigation of situa- 
tional uses of demonstratives - like the studies by Enfield (2003) on Lao and Margetts 
(2004) on Saliba - has been carried out for Vera'a as yet. This is, however, not strictly 
necessary for understanding their use in narrative discourse, as will become clear be- 
low. The basic distinctions of speaker orientation, addressee orientation, and distal in- 
terpretation is relatively clear from the few examples in the video recordings (the house- 
building recording and a subset of narratives), and is the only one relevant for their use 
in narrative discourse. We will see in some examples here that even in exophoric use, 
information-related categories are more relevant than spatial considerations. 

Vera'a demonstratives are participant-oriented, which does not so much draw on dis- 
tance relative to the speaker, but on considerations as to whether the addressee (or the 
speaker) can be regarded as sharing some here-space with the referent. In this system, 
DEM.SPKR forms are strictly associated with the speaker, as can be seen from the follow- 
ing two examples where the pEM.sPKR form contributes the location of an object near 
the speaker, but outside of the immediate perception of the addressee: 


(14) woqe'enge agene ‘ere mom kal'i [...] 

wood DEM.SPKR.A PL put up DEL 

‘This wood here [in speaker's hand], put (it) up, I have just cut (it): 
HouseBuilding10 FQ.005 


(15 n  gasel luwo mu-m gen 
ART knife big POss.GEN-2SG DEM.SPKR 


"Your bush knife is here [where I am]! observed 


Example (14) is taken from a house building recording. In this situation the speaker is 
holding a piece of wood in his hand and is about to pass it up to someone standing on 
the scaffold, so that it becomes a strut in the house's rafter. In uttering (14), he brings 
this new piece of roof structure to the other person's attention. Examples like (15) can be 
observed daily in the Vera'a community. Here, the speaker is sitting in a sleeping house 
and shouts out to the addressee, who is searching for his knife outside and inside the 
kitchen house. One crucial aspect concerning the use of DEM.SPKR is that it seems to 
entail the speaker's assumption that the addressee is not attending to the entity in ques- 
tion (and is possibly not familiar with it). This is illustrated by the following examples of 
direct speech involving situational use within narratives. Here, the speaker is drawing 
the addressee's attention to something that is not only clearly not in their here-space, but 
is also presented in such a way that the addressee is not aware of it: 
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(169 ruwa =n  no-k en be agene n= bē no-k 
HUM:DL =ART POSS.DOM-1SG ART Water DEM.SPKR.A ART= Water POSS.DOM-1SG 
ane ga mana ne. 
DEM.SPKR.A STAT power DEM.ADDR 


‘Hey you two, I have some water here. This water of mine has magic powers. AS. 
1.100-101 


(17) n qo gene wo =n manra gené no me gis gide -k van. 
ART pig DEM.SPKR and -ART money DEM.SPKR 1SG FUT hold 1PL.IN -AOR go 


‘I have a pig here, and some money here. I'll take it and then let's go. 
ISWM.219 


In both examples, the referent is presented as new to the addressee. Once attention has 
been established in (16), the speaker uses the pEM.ADDR form in the following clause 
which further elaborates on the referent. The data examined thus far lend themselves to 
the following hypothesis: DEM.sPKR forms are used where an entity is only in the here- 
space of the speaker, excluding that of the addressee, and where the addressee is not yet 
attending to that entity. 

By contrast, DEM.ADDR forms are used where the referent is not exclusively in the 
speaker's here-space, for instance where addressee also has perceptual access to it or 
where it is in the addressee's here-space. The following two examples illustrate this: 


(18) o kene në ga  telegleg. 
INTERJ LOC.DEM.ADDR DEM.ADDR STAT askew 
‘No, there, that one is (still) uneven [not horizontal]: 
HouseBuilding04 PB.008 


(19 [..]le riegen nara gasel ane [...] 
take out ART mouth knife DEM.SPKR.A 
“Remove that (blade of that) knife!” HouseBuilding09_JJ.009 


Again, both examples stem from the house building video recordings. In (18), the speaker 
is first looking at the measuring string between two posts on the other side of the house’s 
plane. The string is used to check whether both posts have the same height, and here the 
speaker states that it is not horizontal yet. The adjustment of the string is the main thing 
that is happening at this stage, and the speaker can assume everybody to be attending 
to it. In (19), the speaker - standing outside the scaffold — is referring to the knife lying 
in the middle of the house's plane, surrounded by other builders. One of these in fact 
then approaches the knife, apparently following the request. Everybody in this scene 
can be regarded as being aware of the knife lying there. We can assume that the use of 
a DEM.SPKR form would have triggered the addressees' turning their heads towards the 
speaker, the here-space exclusive to the speaker. 

Equivalent examples can be found in direct speech within narrative texts where both 
characters, the speaker and the addressee, are rendered as being aware of an entity and 
attending to it: 
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(20) ne =n én men né 
DEM.ADDR =ART ART bird DEM.ADDR 
"[discussion among characters after hearing some noise out in the bush] “That 


[what we are hearing here] is a bird, believe me!" (There is a bird singing.)' 
MVBW.061 


(21) o bul | wewe wova'al ne  vowalsarené ga  minene'e sa 
INTERJ friend fruit pawpaw NUM one  up.there STAT ripe DEM.ADDR EMPH 
gidu me gen ne 
1DL.IN FUT eat DEM.ADDR 


‘Oh friend, that one ripe pawpaw fruit up there, that's the one we will eat, I tell 
you. GABG.051-052 


The second example also illustrates the omnipresent potential ambiguity between 
exophoric-situational and endophoric-tracking use: as will become clear below, 
DEM.ADDR forms are also the ones used for given-activated referents - i.e. those referents 
already attended to by both speaker and addressee - and DEM.ADDR forms could here be 
understood as referring to the entity both characters attend to in the depicted physical 
environment, or the referent in the depicted discourse of direct speech. Example (16) il- 
lustrates a similar ambiguity in the second clause, where the magic water just presented 
to the two is taken up again in the second clause. 

The following examples of direct speech illustrate the same functional aspect of 
DEM.ADDR forms, this time used adnominally: 


(22) dir -m galno ma | no =m rem a =n  woqe'enge ane. 
3PL =TAM1 lie 1sc hither 1sG =TAM1 climb Loc.sP =ART tree DEM.SPKR.A 
‘They tricked me into coming here, and I climbed up this tree’ JJQ.182-183 
(23) ‘ei bulsal nik ë dada seveele =n bē nanara ane 
INTERJ friend 2SG AOR:2SG RED:do how Loc =ART trunk tree.sp DEM.SPKR.A 
"Hey, friend, what are you doing on this Nanara trunk here?' GAQG.034 


Summarising the observations concerning the use of DEM.ADDR, the following can be 
stated: 1. the entity in question is not clearly located in the speaker's here-space and is 
not clearly excluded from that of the addressee, and 2. both speaker and addressee attend 
to it, or are at least aware of it. 

Comparing the use of DEM.SPKR and DEM.ADDR forms, we can conclude that the for- 
mer are used where the entity is excluded from the addressee's here-space and is being 
brought to the addressee's attention just at the time of utterance, and the latter is used 
where the converse situation holds, i.e. where an entity is not excluded from the ad- 
dressee's here-space and both speaker and addressee are already attending to it at the 
beginning of the utterance. This hypothesis would be falsified, for instance, by examples 
where a DEM.ADDR form is used for an entity clearly in the here-space of the speaker but 
not the addressee - for instance where both are separated by a wall or so, as in example 
(15) above - or where a DEM.sPKR form is used when clearly both speaker and addressee 
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were already attending to the entity at the beginning of the utterance. Such examples 
have not been identified in the data. 

The decisive factor in the use of these two forms is the inclusion versus exclusion of 
the addressee, rather than the speaker's here-space. This is illustrated by the following 
example where the referent of the NP ‘head post’ is in the speaker's here-space, but 
not excluded from that of the addressee. The speaker here appeals to the fact that the 
addressee - in this case myself behind/next to the camera - is attending to the post, 
pointing the camera at it and him: 


24) di ne 'j-'iris e raga a sag keke a =n  wolwolo 
£ 8 
3SG AOR:3SG RED-even PERS HUM:PL LOC.SP up above LOC.SP =ART cross 
e ne =n  woOqa'ag qii ne no =s  'arar 


DEM.DIST DEM.ADDR -ART post head DEM.ADDR 1SG =SIM RED:carve 

ane. 

DEM.ADDR.A 

'So that it is becoming even with the others up there on that cross (beam). This is 
the head post, you know, that one I am cutting’ HouseBuilding08_JJ.002-003 


This shows that DEM.ADDR forms are used where the addressee is assumed to be aware 
of or attending to the entity in question, for instance looking straight at it as in this 
example. The DEM.SPKR forms are, on the other hand, restricted to uses where only the 
speaker has perceptual access to or is aware of the entity in question, as was illustrated 
by example (15) above. The relevant functional aspect of DEM.SPKR forms, distinguishing 
them from DEM.ADDR forms, is that they are exclusive to the speaker's access. 

The hypothesis that DEM.sPKR forms are associated with a strictly speaker-delimited 
here-space receives further support from the observation that DEM.sPKR forms cannot be 
used as expansions of the speaker's here-space to include that of the addressee, or a wider 
spatial dimension. For such contexts — where English would use here — DEM.ADDR forms 
are used. In the following example, taken from an account of local history recounting the 
resettlement of Vera'a speakers at the location of the contemporary village, the speaker 
refers to the village of Vera'a, where the recording is taking place: 


(25 le =n  masogidir -ém mul kal ma kenë 
LOC =ART time 3PL =TAM1 move up hither LOC.DEM.ADDR 


"When they came up here [to today's Vera'a village, where the recording takes 
place) BSVH.066 


The same is illustrated by the following examples from direct speech within narratives: 
in (26) and (27), the speaker refers to the house where he, his associates and the addressee 
are. 


(26) kamame mi-mi'ir ros a vie kamam mi'ir wal 
1PL.EX NEG.GEN1 RED-sleep NEG.GEN2 LOC.SP where 1PL.EX:TAM1 sleep INTENS 
sa  kene 
EMPH LOC.DEM.ADDR 
"We didn't sleep anywhere (else), we slept right here!’ JJQ.396 
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(27) kamam ne gitag én mo-mam [...] kamam ga mi'ir wal sa 
1PL.EX ?? not.exist ART POSS.HOUSE-1PL.EX IPLEX STAT sleep INTENS EMPH 
lé =n nimé ane 
LOC =ART house DEM.SPKR.A 
"We don’t have another house that we could move to. We slept in this house right 


here!’ JJQ.425 


The respective DEM.SPKR forms of locative adverbs are used only where the addressee 
is clearly not included in the speaker’s here-space, as in the following example of direct 
speech where a character is inviting a group of people from far down the coast to stay 
at his place: 


(28) van ma gēdē me ‘og kegen. 
go hither 1PLIN FUT stay LOC.DEM.SPKR 


‘(You guys) come here, and we will stay here together: JJO.273 


That attention focus is the driving force behind the choice between pEM.ADDR and 
DEM.SPKR forms is further supported by examples where DEM.sPKR forms are used with 
referents that are clearly situated at an enormous distance from the position of the 
speaker. The crucial point is that it is not in the addressee's here-space. Moreover, it seems 
that the speaker in this example intends to draw the addressee's attention to something 
new, or something the speaker would assume the addressee not to be aware of at a given 
point. The most blatant example illustrating this point comes from another instance of 
direct speech in a narrative: in (29), a boy asks his father about a light that is far away 
in the bush (and the video shows the narrator pointing into the distance when depict- 
ing the boy's gesture, while looking to the other side to address his father), using the 
DEM.SPKR form: 


(29) le =n qon ne  vowalWowot ne vesire — 'ama-gi so mam 
LOC =ART night NUM one  pers.name AOR:3SG ask PERS father-3sc QuoT dad 
ba =n bur agene -n  sava 
but =ART light DEM.SPKR.A =ART what 


"Ihen one day, Wowot would ask his father: “Dad, but this light here [pointing 
gesture], what is that?" ISWM.075-076 


The explanation for the use of a DEM.sPKR form in line with our hypothesis is that here 
the speaker has the light in mind and now wants to direct his father's attention to it so he 
can ask him about it. Similarly in the following example, the group of people - already 
featured in examples (26) - (28) - is looking for a place to sleep to hide from the evil 
spirit who had invited them to stay with him. The group has been sleeping in different 
parts of the house, hiding from the spirit who wants to kill them. In example (30), one 
member of the group gives them away to the spirit by telling him the actual place they 
spent the night in: 
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(30) kamam mi'ir sa — gene le =n wio 
1PL.EX sleep EMPH DEM.SPKR LOC =ART bamboo 


“Where did you sleep?” And then Tagarqonqon said:] “We slept right here, in 
the bamboos [We slept in the (bamboo) struts.]”’ JJQ.354 


This example contrasts with examples (26) and (27) - taken from the same context. Here 
the speaker draws the addressee's attention to something new, rather than the house, 
that the addressee is familiar with, aware of and shares his here-space with. Specifically, 
the speaker here points to a more specific place within the house that — given the context 
- the addressee was not aware of. Finally, an example from the house building video illus- 
trates the same idea. In (31), the speaker - standing next to the camera and commenting 
on what is happening in the frame - points to different spots where the side posts of the 
house are going to be located, thereby drawing the attention of the addressee - me in 
this case, behind the camera - to these different locations: 


(31) roue ti so =n  'avalLgi a kel sar gene dir -ek mom 
close DEL CPL =ART side-3sG Loc.sP back bushwards DEM.SPKR 3PL =AOR put 
qe =n  'ereqa'aga (0.4) wo dir -ek qe'eg kel en ‘aval-gi 
finish -ART PL post and 3PL -AOR begin back ART side-3sG 
agene. (0.3) 
DEM.SPKR.A 
‘It’s almost like that side further bushwards, they have put all the posts, and then 
they will start on this side over here: HouseBuilding05 1S.008-009 


Here, the speaker first points away from the two of us towards the bushwards side of the 
plane of the house, using a DEM.SPKR form, and then turn to the other side, again using a 
DEM.SPKR form. This consecutive pointing to new locations and referents seems similar 
to the listing use of proximal demonstratives in Samoan (Mosel 2004), and is compatible 
with the idea that such forms are used to draw the addressee's attention to the next, new 
point in the series. 

Finally, DEM.DIST forms are used where an entity is not associated with the speaker or 
addressee, and is not to become the focus of attention. Example (25) above illustrates this: 
the speaker uses a DEM.ADDR form when referring to the post he is cutting, assuming that 
the addressee is also attending to it. But in the immediately preceding clause, he refers 
to the cross beam of the house's scaffold that is outside of what speaker and addressee 
will then be attending to, and relevant only as background. The pEM.DIST form is used 
here. DEM.DIST forms are also used where the speaker cannot fully attend to the referent 
in question because it is outside their perception. This is the case in (32), where Cat is 
asking Rat about the pawpaw that only Rat has access to: 


(32) o bul  rov-rove nike kur kkirmo-n gako wova'al e 
INTERJ friend RED-close 2sG 2sG:AOR gnaw break =ART stalk pawpaw DEM.DIST 


‘Hey friend, will you almost have gnawed through that pawpaw stalk?’ 
GABG.083 
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I conclude that the DEM.ADDR and the DEM.SPKR forms are both used where the speaker 
seeks joint attention focus with the addressee; DEM.DIST forms, on the other hand, are 
used where the referent is outside of the interlocutor's attention sphere, and relevant 
only generally but not right here, or simply impossible to attend to conjointly. Obviously, 
this can be the case where the referent is at a greater distance from both speaker and 
hearer, hence its characterisation as distal. 

Lastly, DEM.SPKR forms are often used in two further contexts that belong to narrative 
texts, but are to be classified as exophoric-situational since they involve consideration of 
the communicative situation of the narration itself. The first of these are cases of Deixis 
am Phantasma (Bühler 1999[1934]) where a narrator uses the physical environment and/ 
or their own body to illustrate aspects of the depicted reality of the narrative (see also 
Himmelmann 1996: 224). Thus, in (33) the speaker shows where a character is holding 
the different kinds of leaves (to signal his peaceful mindset). In (34), the narrator demon- 
strates how a piece of breadfruit is broken off. For this latter example, two interpretations 
seem possible: one in terms of the manner of breaking it off, and one in terms of the di- 
mensions (size, shape) of the piece. In (35), the speaker illustrates the dimensions of a 
rock wall. 


(33) di ga gis en du mel val-gi agené wo =n seg 
3SG STAT hold ART leaf tree.sp opposite-3sG DEM.SPKR.A and =ART tree.sp 
val-gi agene. 
opposite-3SG DEM.SPKR.A 


‘He was holding a mel leaf in this hand here, and a seg leaf on the other side here’ 


ISWM.258 
(34) ‘ubu-gi =m 'uq rak den ën menre-gi ne vowal 

grandparent/child-3sG =TAM1 break.off out ABL ART piece-3SG NUM one 

segene [s] 

MAN.DEM.SPKR 

‘His grandma broke off a piece (of breadfruit) like this: MVB.149 
(35) kel qa-qan luwo segen. 

big RED-walling big | MAN.DEM.SPKR 

‘a wall big like this: ISWM.175 


These uses of DEM.sPKR forms are compatible with the hypothesis outlined above that 
these forms function to establish joint attention. The respective narrator is pointing out 
aspects of his body and corresponding aspects of the narrated reality that he believes 
the audience is not aware of. Similarly, and occasionally hard to distinguish from Deixis 
am Phantasma, narrative frames and meta-comments are essentially exophoric in nature. 
Narrative frames are those passages where a narrative is introduced or ended, or where 
the narrator makes meta-narrative comments, for example stating that the narrative will 
soon come to an end. As for the former case, speakers often start a narrative by referring 
to the act of narrating or the narrative (as a textual whole) that is going to follow. In these 
cases, DEM.SPKR forms are used, as can be seen from the following two examples: 
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(36) no =k kaka =n  nelno vu' agene 

1SG —-AOR:1SG Story =ART voice- spirit DEM.SPKR.A 

‘Iam going to tell the following story: ISWM.001 
(37) no ga  morosno -k dedicatem én kaka agene më =n e 


1SG STAT want 1SG =AOR:1SG dedicate ART story DEM.SPKR.A DAT =ART PERS 
raga mermere 

HUM:PL kid 

‘I want to dedicate this story to the children [...]’ MVBW.004 


A similar context featuring DEM.SPKR use is the following, where the narrator an- 
nounces a song to follow, sung in this case by another person, and meant to be sung 
by a character in the story: 


(38) lé =n nes ‘a bene di =m van esegēn 
LOC =ART song ASS.SP OBL.PRO 3SG =TAM1 gO DEM.SPKR.A 


"Ihe song belonging to it De the story] goes like this: HHAK.091 


Hence, we are dealing here with cataphoric uses of the speaker-oriented forms, which 
are notionally similar to this-new (Wald 1983: 93) uses in English, and which square with 
the attendance-establishing hypothesis: the function of DEM.sPKR forms here is to get the 
audience to attend to the story to come, which of course they are not familiar with, and 
which metaphorically rests with the speaker up to that point where they start the nar- 
ration. As observed by Himmelmann (1996) for a range of languages, these instances of 
cataphoric reference with speaker-based forms to introduce new information are often 
restricted to discourse-deictic use, and this is true for Vera'a as well. Hence, DEM.SPKR 
forms are used cataphorically in this way only where an event or content noun is in- 
volved, as in examples (36-38). 

Conversely, at the end of a story, narrators conclude the narration with reference 
back to the story, stating that it has come to an end. Interestingly, in these instances no 
demonstrative is ever used to modify the noun ‘story’. However, a locative adverb of the 
DEM.ADDR series is used to refer to that end point in the story, for example: 


(39) kaka wunva di =m din  wuvaekene. 

story probably 3sc =TAM1 reach only LOC.DEM.ADDR 

"Ihe story, it is probably over at this point: ISWM.361 
(40) ba =n kaka =m qe sa  kene. 

but -ART story =TAM1 finish EMPH LOC.DEM.ADDR 


"TL and what happened to them I don't know], but the story is over right here’ 
MVBW.137 


This context is not clearly exophoric, but can be seen as an instance of endophoric and 
discourse-deictic use. Two observations seem relevant here. For one thing, these exam- 
ples highlight that DEM.ADDR forms are not markers of definiteness in endophoric use, 
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since the NPs ‘story’ are clearly to be understood as definite in both examples. For an- 
other thing, the use of DEM.ADDR locative adverbs suggests a spatial metaphor for dis- 
course, where narrators frequently make meta-comments by referring to the point of 
the narration where ‘we are’ at a specific point (cf. Diessel 2006: 475). That DEM.ADDR is 
used is motivated by the speaker's assumption that the addressee is conjointly attending 
to the narration, and to that moment in it. 

Finally, there are examples where a story is referred back to by a plain pEM.ADDR form, 
which seems to be most similar to endophoric discourse-deictic use: 


(41) sone'e =n nii kaka wo =n nes ([...] 

?? DEM.ADDR =ART small story and =ART song 

‘So this is a little story and song [about how the incubator bird builds its nest]: 
GATG.065 


The DEM.ADDR form is used in these instances because the speaker assumes that the 
addressee is familiar with the referent and attending to it. 

In sum then, Vera'a has a three-way system of demonstratives. The choice of forms 
is driven primarily by considerations of attention focus and information management: 
DEM.DIST forms are used where an entity is outside the here-space of both speaker and 
addressee, and joint attention to it is not relevant or cannot be obtained. DEM.sPKR forms 
are used where an entity is in the here-space of the speaker, excluding the addressee. It is 
used where the speaker seeks to draw attention to an entity that only they are currently 
familiar with or aware of. DEM.ADDR forms are used where speaker and addressee share a 
here-space with regard to the entity in question, and the speaker assumes the addressee 
to already be attending to the entity in question. The crucial aspect of this system is 
that it has a marked speaker-oriented here-space and attention focus, and that wherever 
the addressee is seen as sharing here-space and attention, the DEM.ADDR form is used. 
The functions of DEM.SPKR and DEM.ADDR forms are essentially different in that only 
the former alters the focus of attention. The function of DEM.ADDR is to maintain joint 
attention focus on an entity for what follows. This can then yield different pragmatic 
effects, as can be seen from the following examples (42) and (43): 


(42) ei nike ané sa =s dada kel én nak mu-k 
INTERJ 2SG DEM.ADDR.A EMPH =SIM RED:do back ART canoe POSS.GEN-1SG 


"Hey, so is it you who has been putting my canoe back up all the time?’ JJQ.064 


(43) “ei kamadu ane =m  wanma sir nik ane. 

INTERJIDLEX | DEM.ADDR.A -TAM1go hither for 25G DEM.ADDR.A 

‘Hey, we came here because of you, [because we want to be together with you]: 
1.PALA.059—-061 


In (42), the DEM.ADDR-marked pronoun occurs in left-dislocated position, which is in 
other cases also associated with topicalisation. Here, however, it is clear that the refer- 
ent has a role more akin to what has been called focus (Krifka 2008, Lambrecht 1994), 
picking out a referent from a putative open set (someone who can do things). Likewise, 
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the use of a DEM.ADDR form with the pronouns in (43) would be hard to motivate by con- 
siderations of identification, or topic marking. If anything, it seems, the subject pronoun 
would have to be construed as a ‘topic’, whereas the PP complement would be in ‘focus’ 
— ‘we are not here just for the dancing or other people, but for you’. These examples thus 
underscore the point that demonstratives do not serve as markers of definiteness, since 
the personal pronouns already entail definiteness. Instead of definiteness, demonstra- 
tives seem to bear functions in the domain of information structure. However, it is also 
clear from these examples that they do not mark specific information-structural roles 
like ‘topic’ or ‘focus’. Rather, in my view, the use of DEM.ADDR forms is motivated here 
by the same considerations of maintained attention focus as in other uses discussed thus 
far, and this has the effect of an inference that what follows is in some way relevant for 
the entities thus expressed. Given other structural properties, like left-dislocation and 
emphatic marking, this may produce a 'focus effect' (Matié & Wedgwood 2013). More- 
over, it does seem to have the effect of singling out entities as “information packages' 
(Ozerov 2015), the interpretation of which depends on the discourse context. I will take 
up this point in 84 and $5. 


3.4 Endophoric use of Vera'a demonstratives 


I now turn to endophoric uses of demonstratives within narrative texts, excluding its 
frame, meta-comments, and direct speech. I call this instance of discourse ‘narration text’. 
The functions of demonstratives in this context are typically associated with anaphora 
and referent tracking (called “tracking use' in Himmelmann 1996), but also with consid- 
erations of information packaging (Diessel 1999). In what follows I will go through the 
uses of demonstratives in different syntactic contexts. It should be noted at this point 
that I will be dealing almost exclusively with the use of DEM.ADDR and DEM.DIST forms 
since DEM.SPKR forms are very restricted in this context, occurring practically only in 
one interesting case which will be discussed first. It will be made clear in $4 below that 
the virtual absence of DEM.SPKR forms in narration text follows straightforwardly from 
their attention-establishing cataphoric function discussed in 83.3. 


3.4.1 Speaker-oriented forms in narrative discourse 


The only example of a DEM.SPKR form in narration text is the following, where the nar- 
rator seems to slip into the report of a character's thought: 


(44) di ne vanma ne rere e ruwa agen [7 di ros. 
3SG AOR:3sSG go hither AoR:3sc head PERS HUM:DL DEM.SPKR.A INTERJ 3SG NEG2 
“Then she would come over and take a closer look at “these two here, no, not 


him" ISWM.326 


This appears to be one of many examples from Vera'a narratives where a narrator seems 
to slide seamlessly from the narration text into direct speech or thought. As in this ex- 
ample, such cases often feature interjections. Thus, in this example direct thought and 
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narration text are not clearly marked off. The use of a DEM.SPKR form here in fact al- 
ready represents an instance of Deixis am Phantasma, and it is motivated by the same 
attention-directing principles discussed above. Hence, for the remainder of this section, 
I will be dealing only with DEM.ADDR and DEM.DIST forms. 


3.4.2 Plain addressee-oriented forms on clause level in narration text 


Plain, i.e. unprefixed, DEM.ADDR forms are very rare in narration text. One recurrent 
occurrence is their pronominal use in subject function. Their function here is discourse- 
deictic, as in the following example (45): 


(45) ne =n  tektek mu-n e | Qo. 
DEM.ADDR -ART speech POSS.GEN-CS PERS PERS.NAME 
‘That was Oo talking there’ JJQ.308 


The DEM.ADDR form here refers back to the last few intonation units in which the narrator 
clarifies that it was the hero of the story, Qo', who said these things. As in exophoric use, 
a DEM.ADDR form can occur in clause-final position in narration text. Here, however, they 
seem to have a somewhat different function in maintaining the addressee's attention to 
a specific point in a narrative: 


(46) di =m ‘dg kelkel oe kelkel n= mei ne onon row 
3SG =TAM1 stay RED:back stay RED:back ART= reef AOR:3SG RED: lie at.sea 
ne' di =m "og kelkel [...] vavavavan. 

DEM.ADDR 3$G =TAM1 stay RED:back RED-go 


"He was staying up (on the island), and the reef was lying down at the sea. So 
now he [the man] was spending his time (there) [and one night he was dreaming, 
the reef came to him in his dream]. ' ISAM.048 


Such instances of DEM.ADDR forms are relatively rare in the corpus, and more data needs 
to be analysed before more substantial conclusions can be drawn. Nonetheless, I would 
like to argue that the use ofthe DEM.ADDR form here has the same function of maintaining 
joint attention to a specific point in a narration. This then has two related effects in this 
context, I believe. For one thing, it yields a simultaneity effect: while the reef is sitting 
down at the shore the man is up at the top of the shore. It also seems to have the effect of 
keeping the reef relevant for the following propositions, leading over to its appearance in 
the man's dream. Although these interpretations would have to be further substantiated, 
they do seem to square with the information-packaging effects to be discussed in 84. 


3.4.3 Adnominal use of addressee-oriented versus distal demonstratives in 
narration text 


DEM.ADDR forms occur adnominally in cases where the referent in question has been 
mentioned in the immediate context. A typical usage context is where the referent was 
introduced in the previous clause and is then taken up immediately again, for example: 
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(47 duru=k ‘én ma =n lumgav ne vowallumgav | ne vowal 
3DL -AOR see hither -ART young.man NUM one young.man NUM one 
ane di ne laa-laka | sene wo ne virig ma [...] 
DEM.ADDR.A 3SG AOR:3SG RED-dance MAN.DEM.ADDR and AOR:35G rush hither 


"Ihen the two spotted a young man. And the young man, he danced, and then 
(he) came over, [and sat down under the wild kava plant]? 1.PALA.044-045 


(48) van ma din en bē bē ne vowal ané =n  so-gi -n 
go hither reach ART water water NUM one ` DEM.ADDR.A =ART name-3SG =ART 
Be'elmamgin. 

PLACE.NAME 


"Went here and reached a river. This (some) river, its name (is) Be'elmamgin [lit. 
‘Cold to dive water]: AS.1.071-073 


(49) ote mu-n e  nii- rene ane ne ma e  duru-k 
mum POSS.GEN-CS PERS small- female DEM.ADDR.A AOR:3SG dead DISC 3DL =AOR 
en ..eh -n — 'ama-gi ne ‘én gor | en ni'i- rene ane 

see COR =ART father-385G AOR:35G see secure ART small- female DEM.ADDR.A 

vaavan'e =n ‘ama-gi ane ne ma’. 

RED-go DISC =ART father-38SG DEM.ADDR.A AOR:3SG dead 


‘Then the mother of the little girl died. So her father looked after the little girl, on 
and on, and then her father died? ANV.005-007 


These examples show again that DEM.ADDR forms do not merely mark an NP as defi- 
nite, and this is also clear from its compatibility with personal pronouns, see $3.3 above. 
Moreover, the total of 361 adnominal uses of demonstrative forms (including all adverbs) 
accounts for only a small fraction of full (lexical) NPs with given referents in our corpus, 
hence givenness/identifiability is likewise not marked by demonstratives. This was also 
made clear with respect to reference back to the story in narrative frames. The point 
is further supported by examples where DEM.ADDR is used with personal names or, as 
in the following example (50). Typically, these are nouns meaning 'older same-sex sib- 
ling, firstborn’ or ‘younger same-sex sibling, lastborn’, with couples of siblings typically 
featuring as heroes in Vera'a literature. 


(50) wo 'isimere ane ne op vee =n dudu vada ne 
and firstborn DEM.ADDR.A AOR:35G hand release =ART leaf pandanus NUM 
vowal. 
one 


“(The devil was then sucking in the sea again, and the sea current [that arose 
from that] brought the canoe [the two siblings are sitting in] ever closer to him.) 
So then the older sister let go of a pandanus leaf? 2.PALA.091 


This example also underscores that DEM.ADDR forms are not merely used to differentiate 
between referents, since this function can be clearly ascribed to the use of the noun itself. 
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In contrast to DEM.ADDR forms, DEM.DIST forms are used adnominally where the referent 
of the NP has been mentioned a while ago in the previous discourse, for example: 


(51) duru=k wos en ñar wos en nar ané 
3DL -AOR hammer ART canarium.nut hammer ART canarium.nut DEM.ADDR.A 
dirol da =n wig me big'o sa =n  roro' e ae duru 
3TL do =ART wig FUT eat carry EMPH =ART k.o.cabbage DEM.DIST INTERJ 3DL 
=k wos en ñar. 
=AOR hammer ART canarium.nut 


'So they would smash up the canarium nuts. Smash up the canarium nuts to 


make wig (a kind of nalot (Bislama), i.e. K.o. pudding), which was to be had with 
that cabbage (that they had gotten earlier on). All right, so they smashed the 


canarium nuts (and then started making nalot)’ AS.1.028-030 
(52) wededi =m luwodi ne l =n nak susu e. 

rain 3SG =TAM1 big 3SG AOR:3SG take =ART canoe RED-paddle DEM.DIST 

‘Then the rain became big, and it took out that canoe’ JJQ.104-105 


A DEM.DIST form can also be used where the referent is merely evoked by or inferable 
from earlier happenings, as is the case in the following example: 


(53) ba di =m gis en menre-kolov- 'ar-'ara e ne mul 'o' 
but 3sc =TAM1 hold ART piece chipping RED-chop DEM.DIST AOR:3SG go carry 
le =n  lolo- (me 
LOC =ART iniside- house 
‘But he took a piece of that chipping, and took it home with himself? JJQ.052 


Here, the actual chopping of the tree was mentioned seven intonation units away, and 
the DEM.DIST form is used because it requires the addressee to re-activate this event to 
relate the ‘piece of chipping’ to it. But apparently its inferability via frame semantics is 
sufficient to warrant the use of a DEM.DIST form that is otherwise used for individually 
given referents. Though there is good reason to assume that the DEM.DIST form in the 
previous examples is an NP-internal constituent, its position is nonetheless ambiguous 
between NP- and clause-final, the latter being another possible slot for both DEM.ADDR 
and DEM.DIST forms (see below). For the sake of completeness, the following example 
shows a DEM.DIST form in a subject NP. This is likewise an instance of inference, since 
the dancing was mentioned ten intonation units before inside a character's direct speech 
who states that they want to dance: 


(54) laklaka e -m van. 

dance DEM.DIST =TAM1 go 

‘And then that dance happened: ISWM.319 
In this example, the narrator is referring back to the idea of holding a dance for the hero 
of the story, who is temporarily dead. After this was mentioned first by some of the 


characters ten intonation units earlier, it was reported how people were walking to the 
place where the dead would have to be found and the dances would take place. 
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3.4.4 Addressee-oriented and distal demonstratives in clause combining 
constructions 


The prefixed basic DEM.ADDR form can occur in clause-final position where it is used 
in clause combining constructions. In the following examples, it occurs in clause-final 
position and the event described therein comes to an end once the second one sets in: 


(55) dir =m | suo kalwal din sar ma ane e | Qo 
3PL =TAM1 paddle up once reach inland hither DEM.ADDR.A PERS PERS.NAME 
ne lañ wor wal en nak. 
AOR:38G slap split once ART canoe 


"And as they had just come up on the shore, Qo' smashed the canoe to pieces: 


JJQ.257 


(56) ale duru=m inin va-van ane e Dol so o 
INTERJ 3DL -TAMI RED:drink RED-go DEM.ADDR.A PERS PERS.NAME QUOT INTERJ 
no man qe. 
1sG PFV finish 
“All right, so the two of them drank and drank until [up to this point, and then...] 
Dol said: “I am done. (The kava has kicked in.)” 1.AS.039-040 


In other instances, the two events are depicted as taking place simultaneously: 


(57 lē =n  masogidi =m sagsag ane ba di ne susur lik 
LOC =ART time 3SG =TAM1 RED:sit DEM.ADDR.A but 3SG AOR:3SG RED:tie more 
en aklé dudu woge enge, 

ART some- leaf tree 


"And as he was sitting (up at the top of the shore), he also assembled some leaves 
(to make some decoration for himself): ISAM.076 


(58) ba dirol-s vanvan ane -n  'isimere mal lama'iso =n ‘ama’. 
but 3TL =SIM RED:gO DEM.ADDR.A =ART lastborn REM.PFV know CPL =ART devil 


“But as the three were walking along, the younger sister had already realised that 
is was a devil’ 2.PALA.018-019 


These examples of DEM.ADDR uses are similar to the use of the plain form in (46). In yet 
other cases, the first clause marked by a clause-final DEM.ADDR form expresses a state of 
affairs that is a presupposition for the proposition of the following clause, expressing for 
example the reason for that proposition holding, as in the following example: 


(59) so Oo di ne sag'i ane [...] di e lama'iros so di 
?? PERS.NAME 3SG AOR:3SG sit DEL DEM.ADDR.A 3SG NEG1 know NEG2 CPL 3SG 
me 'isiw sivie. 


FUT climb.down how 


“(And then they turned towards Gaua). But Qo', he was sitting up there, [that 
darag towla it is so big so] he didn't know how to climb down: JJQ.171-172 
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Generally speaking, clause-final DEM.ADDR forms seem to have the effect of interpreting 
the preceding clause as immediately relevant for the interpretation of the subsequent 
clause. By contrast, clause-final DEM.DIST forms have the effect of steering away from 
the proposition expressed in its clause, while keeping it on hold for future reference. The 
following example (60) is taken from the same story as (59), and occurs long before that 
latter. The nEM.DisT-marked clause expresses the same state as the DEM.ADDR-marked 
one in (59); but here, the subsequent clause is not immediately related to that proposition, 
but instead shows a shift in topic and theme, now dealing with Qo"s brothers and their 
moving away from him. The story then goes on to recall how they steal his wife and 
canoe and travel to Gaua. Qo’’s situation is then taken up again in the clause preceding 
the one in (59). 


(60 e Qo' ne sagren sag kēkē e sap wo e raga 
PERS PERS.NAME AOR:3SG sit stuck up above DEM.DIST sit and PERS HUM:PL 
'i-'isi-gi ane vanrow le =n  vunuo. 
RED-younger.same.sex.sibling-3sG DEM.ADDR.A go down Loc =ART village 
‘So Qo' was now stuck up there (in the tree). Sat (there) while his brothers went 
down to the village. (They stole his wife and his canoe, and paddled off with it.) 
JJQ.151-152 


Noteworthy in this example is also the switch of topic in the second clause and the con- 
comitant use of an adnominal DEM.ADDR form. Another illustrative example involving a 
clause-final DEM.DIST form shows its theme-switching effect: in (61), the context is that 
two men find a turtle at the beach and tie it up to pick it up later. After these recount- 
ings, the two men walk away from the beach and up the shore to their home village. The 
following discourse is about the men's reporting back to their chief and the villager's 
planning to go down, cook the turtle, and hold a feast. The clause-final DEM.DIST forms 
in this instance have the effect of signalling this momentary switch in scene before the 
narration turns back to this referent later: 


(61) ge'ew'i ne wil ‘alan en nud e gë duru =k 
other AoR:3sG turn.over upside.down ART turtle DEM.DIST finish 3DL -AOR 
vrig kalkel le =n lolo vunu 
rush up back roc =ART inside village 


"And then the other one turned the turtle over onto its back, and after that the 
two ran back up into the village: GAQG.057-058 


Regarding this particular example, it should be noted again that the position and function 
of the DEM.DIST form is ambiguous here. The discussion of other examples of adnominal 
DEM.DIST forms above would suggest that its use in (61) is not to be classified as adnom- 
inal, since the turtle had just been the theme of the preceding discourse, and it seems 
more plausible to interpret it as clause-final, bringing about the effect outlined here. This 
conclusion should, however, be taken as preliminary, and further examples are required 
to substantiate — or falsify - it. 
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4 Discussion: Attention and information 


I first summarise the findings from the discussion of exophoric and endophoric uses, and 
then present a preliminary analysis of these. In exophoric use, DEM.SPKR forms are used 
where only the speaker is familiar with or aware of an entity, and attends to it. The form 
is used in order to draw the addressee's attention to the same entity, thereby establish- 
ing a joint attention focus. Relevant contexts exemplified above are those where only 
the speaker can see (or otherwise perceive) the entity; where the speaker assumes that 
the addressee is not familiar with an entity, for instance the story in introductions to 
narratives; where the speaker assumes the addressee to be unaware of the entity to be 
talked about, for instance the piece of wood brought to a house builder, the enacting of 
bodily aspects in Deixis am Phantasma, or the light seen at night in the bush. The last 
example makes it clear that spatial considerations are not the relevant ones: what counts 
instead is the awareness-state of the speaker, and their assumptions concerning that of 
the addressee. Obviously, in communication the speaker will always be aware of the 
entities they verbalise and elaborate upon, thus speaker-oriented forms are naturally 
used when the addressee is assumed to lack that awareness or knowledge (see Mosel 
2004 for the same point on Samoan proximal demonstratives). The DEM.SPKR forms are, 
however, never used cataphorically for new referents in a narrative, and it seems pos- 
sible that this kind of use is typologically very restricted (English new-this), as already 
remarked by Himmelmann (1996), a tendency corroborated by the case of Vera'a. As for 
locative adverbs, DEM.SPKR forms are used where the speaker perceives their location as 
not overlapping with that of the addressee. 

The addressee-oriented forms, DEM.ADDR, are used where the speaker assumes the 
addressee to already be aware of and attending to an entity. With the plain forms in 
pronominal function within exophoric use, this concerns mostly immediately perceived 
sensations, like sounds. In endophoric use, these pronominal instances of DEM.ADDR 
forms have discourse-deictic functions, referring back to the immediately preceding dis- 
course. In adnominal use, the forms are used where the entity in question is already 
being attended to by the addressee. One effect of its use is that the referent has to be 
understood as individuated, whereas an unmarked NP can always have class reference; 
and given that Vera'a does not distinguish generic from specific contexts in other parts 
of the grammar, for instance tense, aspect, mood-marking, this is an omnipresent poten- 
tial ambiguity. In narrative texts, a DEM.ADDR form is used adnominally in NPs whose 
referents are identifiable from the immediately preceding discourse context. This can be 
seen as equivalent to established attention to an entity in exophoric contexts. Their use 
does not mark the NP as having an identifiable referent, since most other NPs with such 
identifiable referents are unmarked, and DEM.ADDR forms also occur with pronouns and 
proper names. My hypothesis is that their use is connected to considerations of infor- 
mation packaging (see below). Finally, DEM.ADDR forms are used in clause-combining 
constructions where the clause they mark is immediately relevant for the interpretation 
of the subsequent clause. 
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Distal forms are used in exophoric contexts where an entity is outside of the speaker's 
and the addressee's interaction space, and also not within speaker's here-space. This was 
illustrated with an example from the house building video where the speaker refers to the 
cross beam ofthe scaffold, but obviously then wants the addressee to focus their attention 
on the wood in his hands. In endophoric use, a DEM.DIST form is used adnominally where 
a referent is being re-activated, and is used clause-finally at thematic or episodic shifts. 

Comparison of DEM.ADDR and DEM.DIST forms reveals that the latter are used merely 
to activate information, but not to focus on it, whereas the former have the effect of 
demanding focus of attention. This was demonstrated in example (4), repeated here as 
(62): 


(62) ne ene ruwa | malmala ë =s sagi e ruwa  malmala 
AOR:35G see PERS HUM:DL girl DEM.DIST =SIM sit DEL PERS HUM:DL girl 
ane so 


DEM.ADDR.A QUOT 


‘[...] and then (he) spotted those two girls (who were) sitting (there), and the two 
girls said: ..’ 1.PALA.059-060 


The use of a DEM.DIST form in the first mention of ‘the girls’ functions to re-activate 
the referent who had been mentioned before, but the few intonation units immediately 
before this dealt exclusively with the young man’s dancing. In the second clause, a 
DEM.ADDR form is used, and here seems to have a topicalising effect, signalling that the 
clause is about them. This suggests furthermore that information packaging and refer- 
ent activation are two different aspects of discourse structure, as illustrated in Vallduvi 
(1993: Chapter 2) , although this is often blurred in a Givón'ian sense of ‘topic’ that seems 
to conflate notions dealt with under the headings ‘discourse referent’, ‘discourse topic’, 
and ‘(sentence) topic’ in other work. 

But does a DEM.ADDR form as such have a topicalising function, or even ‘topic-marking’ 
function, as is sometimes claimed for Oceanic languages? At least in Vera'a, the answer 
should be no. We have seen in examples above that DEM.ADDR forms are in fact found to 
mark pronouns that are clearly in a relation one would traditionally call ‘focus’, namely 
where the pronoun is left-dislocated and marked with the emphatic marker sa. As out- 
lined above, I suggest here that DEM.ADDR forms are not ‘polyfunctional’ in any way, 
sometimes ‘marking or expressing topics’, and in other contexts ‘marking or expressing 
focus’. Instead, I analyse these uses in the spirit of Matić & Wedgwood (2013), Ozerov 
(2014) and Ozerov (2015) as bringing about an effect that is compatible with both ‘topic’ 
and ‘focus’, as discussed in the literature (Lambrecht 1994, Krifka 2008). In this way, a 
topicalising effect is merely brought about by the DEM.ADDR form meaning ‘addressee 
keep attention focused on this’, which can trigger the inference that something relevant 
will be said about the entity in question. Likewise, keeping attention focus can have the 
effect of stressing someone's involvement in a state-of-affairs, as in the examples where 
someone is asked whether he was the one who put a tree back up. 

The clause-combining uses of DEM.ADDR forms can be accounted for in a similar man- 
ner: namely, they have the effect of holding a proposition in attention focus. Given the 
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maintained attention to the proposition, it is interpreted as immediately relevant for the 
proposition to follow, so that the latter is interpreted as being about the former. The 
use of DEM.DIST forms, on the other hand, has the general function of activating a refer- 
ent, but not making it the attention focus. In clause-combining constructions, this bears 
the effect of topic and/or theme/episodic shift, together with the expectation that the 
previous episode may be relevant in some way at a later point in the discourse. 

Therefore, the use of demonstratives in exophoric and endophoric contexts follows 
straightforwardly from their meaning related to attention focus, as shown in Table 5. 
This meaning of demonstratives can be stated as relatively slim, much in the spirit of 
Enfield (2003). All readings summarised above are the result of these stable meanings 
interpreted by interlocutors in relevant contexts. 


Table 5: Meaning of demonstratives in Vera'a 


speaker-oriented you do not attend to this 
addressee-oriented I and you attend to this 
distal I do not attend to this 


5 Conclusions 


In conclusion, demonstratives in Vera'a bear attention-focusing meaning in the sense of 
Diessel (2006), and their interpretative effects result from interaction of speaker and ad- 
dressee in relevant communicative contexts, much in the spirit of Enfield (2003). Spatial 
considerations are not primary, that is to say that orientation with regard to speech- 
act participants is not spatial, but communicative-interactional, relating to dimensions 
of knowledge and attention coordination. The interactional interpretations triggered by 
the different meanings of demonstratives produce different effects in different contexts, 
which means that we do not need to assume polysemy for demonstratives. I conclude 
that their meaning is absolutely stable across exophoric and endophoric uses, and sub- 
uses therein. 

Finally, it should be noted again that more extensive examination of more video data 
- which has been collected in large amounts over the past ten years - is necessary to 
substantiate the extensional aspects of demonstratives in exophoric use, and this should 
be done in future research. Thus, for instance it would be interesting to see whether 
Vera'a speakers can use a speaker-oriented form to point to something unnoticed by the 
addressee on the latter's body or clothing, and the like. Possibly also the use of certain 
stimuli (like the MPI space games) may help clarify some usages. I do believe, however, 
that even these initial observations from the currently available data are sufficient to 
draw up the basic outline of the system as done here. Moreover, it should be noted that 
spatial considerations as observable in video data are not an objective heuristic to explain 
the use of demonstratives, as illustrated convincingly by Enfield's (2003) examination 
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of such data in Lao. It seems that Vera'a presents another illustrative example of the 
salience of interactional pragmatics considerations in accounting of multi-contextual 
uses of demonstratives. 
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Abbreviations 
1 1st person IN inclusive 
2 2nd person INTENS intensifier 
3 3rd person INTERJ interjection 
A a prefix LOC locative 
ABL ablative MAN manner 
ADDR addressee-oriented NEG.GEN general negation 
ADN adnominal NSG non-singular 
AOR aorist NUM numeral article 
ART common article OBL oblique 
ASS associative PERS personal article 
COR correction PFV perfective 
CPL complementizer PL plural 
cs construct suffix POSS.DOM domestic possession 
DAT dative POSS.GEN general possession 
DEL delimitative aktionsart POSS.HOUSE house possession 
DEM demonstrative PRO pronoun 
DISC discourse marker QUOT quotative 
DIST distal RED reduplication 
DL dual REM.PFV remote perfective 
EMPH emphatic SG singular 
EX exclusive SIM simultaneous 
FUT future SP specific 
GRAID Grammatical Relations SPKR speaker-oriented 
and Animacy in STAT stative 
Discourse TAM tense-aspect-mood 
HUM human TL trial 
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This chapter discusses referential strategies in Manado Malay (MM), a variety of trade Malay 
spoken in North Sulawesi, with special focus on how a lexical NP is marked according to 
the information status of the referent. Like some other Malay varieties, MM uses two strate- 
gies to indicate definiteness: articles and the third person singular possessive. The articles 
are derived from demonstratives and used for direct situational and anaphoric reference, 
while the possessive is used for reference in which some kind of association is required for 
identification. An article and a possessive may co-occur in one NP. The semantic domain 
each form covers is not exclusive to the other but rather belongs to intrinsically different 
semantic dimensions. Thus, the MM system enables speakers to mark that the referent is 
textual-situationally accessible and, at the same time, associable to the larger shared situa- 
tion. 


1 Introduction 


This paper discusses referential strategies employed in lexical NPs in Manado Malay 
(hereafter MM). There, forms functionally similar to what is called the “definite marker” 
in other languages are grammaticalizing from two distinct sources: one is from the third 
person singular possessive marker depe and the other is from the demonstratives. 

MM is a variety of trade Malay spoken in Indonesia by upwards of 2 million people 
in North Sulawesi, the Sangir and Talaud archipelagos to the north, and Gorontalo to 
the west. It seems to have developed from North Moluccan Malay, but it has developed 
independently since the 17th century (Paauw 2008: 43-44). Until relatively recently, first 
language speakers were mainly found in the city of Manado, while elsewhere MM was 
used as a second language by speakers of the indigenous Minahasan and Sangiric lan- 
guages. In recent decades MM has become the first language of virtually the entire pop- 
ulation of the region. Although most of the Minahasan and Sangiric languages are still 
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spoken, even elderly people grew up with MM and it could be considered a “joint” first 
language, while for many people of all ages, it is their first or even their only language. 

The notion of monolingual MM speakers requires some clarification. The education 
system, media, and government administration largely use standard Bahasa Indonesia 
(BI), and so everyone is exposed to this variety and code switching and mixing are perva- 
sive. Some speakers are clear about the significant grammatical and lexical differences 
between BI and MM, and they call MM “Melayu Manado” or “Bahasa Manado”, recogniz- 
ing that it is not the same as BI. Others do not have this meta-awareness and believe that 
the language that they speak is BI. As noted by Paauw, "Manado Malay and Indonesian 
(and, in particular, colloquial Indonesian) have been converging to the point that speak- 
ers of Manado Malay, to varying extents and often subconsciously, employ Indonesian 
vocabulary and constructions when using Manado Malay, and it is often difficult to draw 
a line between the two languages" (Paauw 2008: 44). 

The data sources of this study are (i) translation/elicitation from standard Indonesian 
sentences, (ii) semi-spontaneous monologue that was obtained using a procedural video 
as stimulus, and (iii) an unpublished MM-BI dictionary compiled by the Pusat Penerjema- 
han Bahasa (PPB, Translation Centre) in Tomohon. The last item was particularly useful 
and the authors would like to thank Albert Polii for making it available to us. 

The structure of this chapter is as follows: §2 provides a brief overview of the NP 
structure of MM. In §3, we will examine the semantic function of the two definite mark- 
ing devices, that is, articles and the third person possessive depe based on elicited and 
published data, and provide a brief comparison to the other Malay varieties. In §4, we 
will see larger texts elicited using a procedural video and confirm the usage of the two 
devices discussed in §3. In §5, we look at the MM definite marking strategy from a cross- 
linguistic perspective. 


2 NP structure in MM 


Before discussing the referential strategy of MM, we will show the NP structure in MM, 
largely based on Prentice (1994: 424—429). (1) is the structure that Prentice suggests. Note 
that Prentice calls the demonstrative *deictic". 


(1) (article) (PossEsson-*pe) Nheaq (attributive n/v)! 


Two articles tu and ni, “both translatable by the” (Prentice 1994: 424), are derived from the 
distal demonstrative itu and proximal demonstrative ini, respectively. “The articles both 
mark the referent of the following noun as being known to both speaker and addressee, 
while ni has the added function of indicating geographical temporal and/or psycholog- 
ical proximity to the speaker” (Prentice 1994: 424). Examples (2a-b) are examples from 
Prentice (1994: 424). 


1Quantifiers may precede or follow the head noun according to its pragmatic status, which we will not go 
into further in this research. 
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(2 a. tu anging 
ART wind 
‘the wind (e.g. which blew down my coconut palms.) 
b. ni anging 
ART wind 


‘the wind (e.g. which is blowing now.)’ 


Prentice suggests that the demonstratives may either precede the head-noun alone or 
follow the combination of article + noun, as shown in Example (3a-d). 


(3) ‘that island’ or ‘those islands’/ ‘this island’ or ‘these islands’ 


a. itu pulo 
that island 


b. ini pulo 
this island 


c. tu pulo itu 
ART island that 


d. ni pulo ini 
ART island this 


We assume that Prentice's data was collected in the 1980s and 1990s. More recent MM 
data shows that the pre-predicate slot is more frequently, though not exclusively, filled 
by the article. Thus, phrases like (2a-b) or (3c-d) are more common than ones like (3a-b). 

In more recent MM data, the form tu may co-occur with either the demonstrative itu 
or ini, as seen in tu ruma itu in example (4) and tu parkara ini ‘this problem’ in example 


(5). 


(4 Tu  ruma itu ancor lantaran da kena bom waktu prang. 
ART.D house that broken because pst affected bomb time war 


“That house is broken because it was bombed in the war. (PPB:2) 


(5 Tu  parkaraini so lama nyanda klar-klar. 
ART.Dissue  DEM.PPFVlong NEG solved 


“This issue has not been solved (lit. finished) for a long time (PPB:62) 


Example (4) and (5) suggest that the form tu has undergone semantic bleaching, as it is 
neutral regarding the distance to the reference point. 

The occurrence of the determiners tu and ni exhibits a syntactic restriction in that they 
only occur with S, A, and P but not with an oblique. Consider examples (6) and (7) below, 
which Prentice (1994: 430) provides to show word order variation in the MM transitive 
clause. Examples (6) and (7) both denote almost the same proposition in which “T” is the 
actor, the basket is the location, and the rice is the theme; and the non-agent NP occurs 
with the determiner tu only when it is P. 
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(6) Kitaso isi tu loto deng padi. 
1sG PFV fill AnT.D basket with rice 


‘T have already filled the basket with rice’ 


(7) Kitaso isi tu padidi loto. 
isc PFV fill ART.D rice at basket 
‘I have already filled the rice with a basket? 


In possessive structures, the possessor noun or pronoun precedes the head (the possessed 
item) being followed by the possessive marker pe, the short form of punya ‘have’ in 
standard Malay. Table 1 contains the paradigm of possessives with personal pronouns 
and a lexical noun. 


Table 1: Possessives 


1sG kita / ta pe kita pe anak ‘my child’ 

1PL torang/tong pe torang pe anak ‘our child’ 

2sG ngana pe ngana pe anak ‘your (sc.) child’ 
2PL ngoni pe ngoni pe anak “your (Pt.) child’ 
3sG2 dia pe / depe depe anak “his/ her/ its child’ 
3PL dorang / dong pe dorang pe anak ‘their child’ 
lexical noun NOUN pe kamar anak pe kamar ‘a child's room’ 


Among the forms presented in Table 1, the long form of the first-person possessive 
(kita pe) and the short form of the third person (depe) are not shown in Prentice (1994: 
424). However these forms, especially depe, are much more frequently observed in cur- 
rent MM than their alternatives. 

In MM, the possessor is obligatorily marked when the referent of the matrix NP is 
possessed by, or has a part-whole relation to, a referent whose identity is clear from the 
previous utterance - thus, in sentences (8) and (9), the possessive obligatorily occurs. 


(8) Tu anakpe gaga. depe mata basar deng depe mulu kacili. 
ART.D child Poss beautiful 3sG.Poss eyes big and 3sc.Poss mouth small 


"How beautiful the child is. Her eyes are big, and her eyes are big, and her mouth 
is small? (elicited) 


(9) Sayang ini pohon, depe ujung so potong. 
pity DEM.P tree  3sG.POsstip PFV cut 
‘(This) poor tree, the (its) top has been chopped off? (elicited) 


"The third person pronouns dia (sc) and dorang (PL) may refer to both animate and inanimate referents, and 
so may the possessives, as seen in sentence (8) and (9) among others. 
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The article and possessive may co-occur in pre-head noun position, as seen in examples 
(10) and (11), suggesting they are assigned to separate syntactic positions.? 


(10) Serta tu depe ubi milu deng sambiki so  mandidi. 
after ART.D 3SG.POSS potato corn and pumpkin Prv boil 


‘after the potato, corn and pumpkin are boiled’ (elicited narrative, speaker D: 45) 


OU Diano tu /ni_ kitape papa. 
3SG PTC ART.D / ART.P 1SG poss father 
He is my father. (lit. the my father) (PPB dictionary:89) 


This co-occurrence also suggests that they each have semantic functions independent of 
each other. We will return to this point in $4. 

As will be seen in the section that follows, the use of depe partially overlaps with that 
of English definite article the, but not all depe-marked NPs refer to a so-called definite 
referent. 

In example (12), neither the possessor ayang ‘chicken’ or de '3sc' in the possessive is 
referential, but used attributively.* 


(12) Kita suka ayang pe kaki, mar nyanda suka depe dada. 
1sG like chicken poss leg but NEG. like 3sG.Poss breast 


‘I like chicken leg meat, but not chicken breast meat: (elicited) 


The development of the articles and possessives that we have seen in this section have 
been observed in other varieties of Malay, to a lesser or greater extent. We will give a brief 
comparison in $3. The variation of the position of the demonstratives and the long and 
short forms of the third person possessives mentioned above illustrate the transitional 
status of the two strategies. 


3 Semantic functions of the articles and the possessive 
construction 
As mentioned in the introduction, MM has developed two types of definite markers, 


the sources of which are the demonstratives and possessives. Their compatibility in one 
NP (e.g. tu depe ruma 'the house of him/her/it) implies that each device has a function 


3As for the status of possessives, Lyons (1999: 130-134) proposed a typological distinction of DG language 
and AG language; in the former, the possessive is assigned to the determiner position and, in the latter, to 
the adjectival or some other position. The compatibility of the article and possessive, seen in sentences (10) 
and (11), suggests that MM belongs to the latter (AG) type. 

^Note that the antecedent of depe in example (12) is the expression ayang “chicken”, not the referent of 
the expression ayang ‘chicken’. (See Krifka & Musan 2012: 23 on the distinction of expression givenness 
and denotation givenness.) This type of anaphorical usage is not observed in the third person possessive 
pronoun in many other languages, such as English its or nya in standard Indonesian. Thus, the sentence 
“I like chicken leg meat, but not its breast meat’ cannot be accepted as the English counterpart of example 


(12). 
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independent of each other. In this section, we will examine the semantic function of each 
strategy, mainly based on MM sentences obtained as translations of target sentences 
from standard Indonesian and utterances observed in every day conversation. 

Hawkins (2015: Chapter 3) makes a distinction between four major usage types of the 
definite article the: anaphoric, immediate situational, larger situational, and associative 
anaphoric uses. 

The MM articles ni and tu are used in cases similar to the first two types, that is, 
anaphoric use and immediate situational use. In sentence (13), the two forms are used 
for making reference to the entity in the speech situation, in sentence (14), one of the 
two forms tu is used for making reference to the entity or situation mentioned in the 
previous discourse.” 


(13) Bole pinjam tu — / ni pulpen? 
may borrow ART.D / ART.P ballpoint.pen 


‘May I borrow that ballpoint pen?’ (elicited) 


(14) Ada parampuang gaga deng dua anak da masuk ke satu ruangan. kita 
exist woman beautiful and two child pst enter to one room  1sc 
langsung tahu sapatu ` parampuan itu. 
directly know who ART.D woman DEM.D 


“An elegant lady and two children came in the room. I immediately knew who 
the woman was: 


These two uses correspond with what Lyons (1999: 166, 198) calls “textual-situational 
ostension". According to Lyons, ^what these have in common is that the referent is im- 
mediately accessible? Lyons suggested that a primary distinction of definiteness should 
be made between textual-situational ostension and other usages. The former function- 
ally overlaps with demonstrativeness, and the others do not. A similar view is presented 
in many previous studies, such as Hawkins (2015: Chapter 3), Himmelmann (1996), and 
De Mulder & Carlier (2011: 528). 

Demonstratives are a well-known source of definite markers in many languages, as 
suggested by Heine & Kuteva (2002) and Lyons (1999) among others. De Mulder & Car- 
lier (2011: 528) suggest that the crucial semantic shift from demonstratives to the def- 
inite article is seen from direct reference that corresponds to the direct situational use 
and anaphoric use of Hawkins, to indirect reference, which corresponds to anaphoric 
associative use and larger situational use. 

Notwithstanding the distinct syntactic position in NP from the demonstratives, the 
uses of the articles in MM have not undergone a semantic shift and have not extended 


5(14) is a sentence obtained as a rough translation of sentence (i) below; an example of anaphoric use of the 
English definite article is given in Lyons (1999: 3). 


(i) An elegant, dark-haired woman, a well-dressed man with dark glasses, and two children 
entered the compartment. I immediately recognized the woman.... 
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beyond direct reference. Instead, indirect uses are covered by the third person possessive 
depe '3sc.Poss' in MM. In the anaphoric associative use of the, the NP refers to something 
associable to the referent of a previously mentioned NP, while in the larger situational 
use, the NP refers to something associable to the situation of the utterance itself. In 
both uses, the hearer is supposed to use shared general knowledge for identification; the 
hearer and the speaker need to know the referent is associable to the antecedent or the 
utterance situation in question. 


Sentences (15-16) are examples of anaphoric associative uses. 


(15) Kitalebe suka Australia daripada Jepang karna depe sayur-sayur lebe 
ısG more like Australia from ` Japan because 3sG.Poss vegetable.RED more 
sadap deng murah. 
tasty and cheap 


‘I like Australia more than Japan, because vegetables there are tastier and 
cheaper: (elicited) 


(16) Kita baru pulang dari pesta kaweng. Depe ` broidta pe tamang. 
1sG just come back from party wedding. 3sG.Poss bride 1sc Poss friend 


' have just come back from a wedding party. The bride was a friend of mine’ 


(elicited) 


Employment ofthe third person possessive depe for this use can be easily explained by its 
original meaning; the possessive depe includes de, the shortened form of the third person 
pronoun dia ‘3sc’. The pronoun dia may be used as an anaphor, and in the possessive, it 
indicates the presence of a whole to part relation between the referent of the pronoun 
and the matrix NP. 

From sentences (15) and (16) above, we can see that the semantic relation between the 
possessor and the head noun is not limited to the simple whole to part relation that is 
exemplified in sentence (8) and (9) shown in 82. There may be various relations, such as 
location, as seen in example (15), and occasion, as in example (16). 

However, the semantic range the possessive covers does not seem to perfectly overlap 
with that of anaphoric associative the. Consider example (17), which Lyons (1999: 3) gives 
as one of the examples of associative use of the English definite article. 


(17 Ihad to get a taxi from the station. On the way, the driver told me there was a bus 
strike. 


In sentence (18), a rough MM equivalent of sentence (17), the counterpart of the English 
definite NP does not receive any explicit marking, as seen in sentence (18). 


Example (16) is obtained as a rough MM equivalent of sentence (ii) below; an example of associative 
anaphoric use of English the given in Lyons (1999: 3).(ii) ‘I have just come back from a wedding party. 
The bride wore blue: 
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(18) Ni hari kita da nae taksi dari stasion. Di tenga jalang (*depe / *tu) 
ART.P day 1SG PFT ride taxi from station at middle way ^ 3sG.POSS  ART.D 
sopir se tau tadi ada cilaka brat. 
driver caus know before exist accident heavy 


‘I had to get a taxi from the station. On the way the driver told me there was a 
serious accident. (elicited) 


In this situation, we can reasonably associate the referent of sopir ‘the driver’ to taksi 
‘a taxi’, and that is the reason the NP undergoes the definite marking in English sen- 
tence (17), but that is not the case in MM. The reason may be that the semantic relation 
between sopir ‘the driver’ and the associated taksi ‘a taxi’ cannot be taken as a possessor- 
possessed, or whole to part relation, to the MM speakers; one of the MM speakers sug- 
gested that he could not use the possessive depe here because the driver possessed the 
taxi, not the reverse. This example may show the difference between the English defi- 
nite article in associative use and MM possessives; the former may indicate any type of 
association, while the latter exhibits some limitations which presumably are attributed 
to the original possessive meaning. At the present stage of our research, however, we 
do not have enough data to provide the precise condition where the possessive may or 
may not occur. 

The use of depe in example (19) and (20) overlaps with the "larger situational use" of 
the in Hawkins's classification, where the referent of the depe NP is associable to the 
utterance situation. Note that there is no clear antecedent of the possessive in these 
examples. 

In sentence (19), the NP depe cuaca refers to the weather of the place the speaker and 
hearer are located in. 


(19 Depe cuaca bae. 
3sG.POSS weather good 


"Ihe weather is nice (today). (spontaneous utterance obtained from daily 
conversation) 


The sentences in (20) are from a Facebook post. Example (20a) is the original Facebook 
post made with a picture of yams, and (20b) and (20c) are comments posted by two 
friends of the poster. In both comments, ubi ‘yam’ mentioned in the original post is 
marked by tu and depe, and the antecedent of depe is not explicitly mentioned. 


(20) a. Slamat pagi, panen ubi jalar serta menanam ulang. 
good.morning harvest yam spread after plant again 


‘Good morning, harvesting yams and then planting them again’ 


TWe might be able to infer that if the ‘possessed’ NP is animate and the ‘possessor’ NP is inanimate, the 
marking with depe may not be permitted, as it contradicts the concept of possession we intuitively would 
have. 
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b. Mantaap Beng pe besar-besar kang tu depe ubi?’ 
great Beng very big.RED ITR ART.D 3SG.POSS yam 


‘Great Beng the (lit. the its) potatoes are very big, aren't they?’ 


c. Banyaktu depe batata ada panen. 
many ART.D 3SG.POSS sweet.potato PST harvest 


‘Lots of the (lit. the its) sweet potatoes were harvested? 


The lack of a clear antecedent? in sentences (19) and (20) shows that the form depe does 
not function as the possessive marker. Instead, we can claim that the form depe plays a 
similar semantic role to the larger situational use of the, whichever label we give to it in 
MM grammar. In this use, the referent is identified by two processes: one is identifying 
the nature of the "shared" larger situation intended by the speaker, and the other is 
identifying the referent using the "shared" knowledge that presupposes the existence of 
the referent in the situation (Hawkins 2015). 

A similar type of development from the possessive to the definite marker is observed 
in other languages that are not genetically related, such as Amharic (Rubin 2010) and 
Yucatec Maya (Lehmann 1998: 86-88), as well as colloquial Indonesian, as mentioned in 
83. This development can be explained by an affinity between the association and indi- 
cation of the part-whole relation. Hawkins (2015: 123-124), in discussing the similarity 
of associative anaphoric and larger situational use, claims that “(T)he notion ‘part-of’ 
seems to play an important role in defining the number of possible associates. The trig- 
ger (of the association) must conjure up a set of objects which are generally known to be 
part of some larger object or situation.” (For a more recent and precise discussion of the 
development from possessive to definite marker, see Fraund 2001; Gerland 2014; 2015). 


*The commentator uses the spelling of ber? and dp for besar-besar and depe, respectively, in her original 
post. 

?We asked the commenter to identify the antecedent of depe in sentence (20)c several times, but her answers 
were not consistent. Her response may show that the referent of the antecedent is not a concrete entity that 
can be clearly mentioned. We might be able to insist that the third person pronoun de refers to the implied 
“shared situation”, but the claim may not be accepted, because the third person pronoun dia, from the long 
form of de in depe, may not refer to the situation or proposition. Consider the three pairs of sentence (i). 
A situation or proposition can be referred to only by the demonstrative begitu, not by the third person 
pronoun dia. 


(i) a. Albert so nya mo pusing deng orang laeng pe emosi. So bagitu Albert pe 
Albert PFV NEG want bothered with person other poss emotion PFV like.that Albert poss 
kalakuan. 
behavior 
“Albert doesn’t want to be bothered with other people’s feelings. The character of Albert is 
like that’ (elicited) 

b. Albert so nya mo pusing deng orang laeng pe emosi. “Dia Albert pe kalakuan. 
Albert PFV NEG want bothered with person other poss emotion 3sc Alert Poss behavior 


‘(Intended meaning) Albert doesn't want to be bothered with other people's feelings. That's 
the character of Albert: 
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Other varieties of Malay exhibit similar developments to a greater or lesser extent. 
Adelaar & Prentice (1996: 675) suggest that the use of the short form of the demonstra- 
tives ni and tu as well as forms such as pu or pun (derived from punya ‘have’ as pos- 
sessive marker) are among several morphosyntactic features shared among trade Malay 
varieties, which Adelaar & Prentice (1996: 675) call Pidgin-Derived Malay (PDM) vari- 
eties. Regarding the development of demonstratives into the definite markers, Adelaar 
(2005: 212-217) points out the anaphoric use of the short forms of demonstratives tu and 
ni in Ambon Malay and Cocos Malay; they also underwent semantic bleaching similar 
to that of MM. Similar types of development are reported in both Papuan Malay (Kluge 
2017: 384-388) and Ternate Malay (Litamahuputty 2012: 263, 277). 

The development of the possessive into a definite marker is also observed in collo- 
quial Indonesian, in which the third person possessive enclitic =nya is used to indicate 
identifiability, exhibiting functions similar to MM depe in associative anaphoric use and 
larger situational use (Englebretson 2003: 161-168). A rather different distribution was 
observed in Baba Malay, spoken in Malaka and Singapore by "Strait-born" Chinese. In 
Baba Malay, the articles ini and itu cover larger semantic domains, including associative 
anaphoric use and larger situational use (Thurgood 2001: 477-480), although the third 
person possessive suffix -nya also has similar functions to the articles (Ihurgood 1998: 
132-135). 


4 Determiners and possessives in elicited procedural text 


4.1 Method 


In this section, we will see larger texts elicited by a short cooking video as stimulus to 
confirm the syntactic and semantic functions of the two strategies outlined in the pre- 
vious sections. The advantage of employing this method for elicitation is that (i) this 
type of non-linguistic stimulus enables us to collect more naturalistic data without the 
influence of a medium language, and (ii) the reference tends to be clear in the text ob- 
tained through this method when compared to purely spontaneous utterance in which 
the referent of each NP may not always be easily identified (see Majid 2012 for details of 
elicitation methods using stimulus materials.) 

The video employed as stimulus here is titled Tinutuan 'Manadonese porridge’. The 
video was shot by one of the authors and is available from https://youtu.be/cyJanYZjXoo. 
We asked four speakers of MM (H, I, D and A) to watch the video and give a commentary 
in MM. In the video, the main dish tinutuan ‘Manadonese porridge’ and the side dishes 
tahu goreng ‘fried tofu’ and dabu-dabu ‘chili sauce’ are cooked. The outline of the cooking 
process is shown in Table 3. 
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Figure 1: Tinutuan ‘Manadonese porridge’, tahu goreng ‘fried tofu’ and dabu- 
dabu “chili sauce”. 


Table 2: MM speakers who provided the narrative 


Name Age From Mother tongue 
H 65 Beo, Talaud Talaud 

I 36 Beo, Talaud Manado Malay 
D 34 Sonder, Minahasa Manado Malay 
A 55 Tomohon, Minahasa Tombulu 


Table 3: Outline of the cooking process 


Scene 1: showing ingredients 

Scene 2: cut and peel hard vegetables such as yam and pumpkin 

Scene 3: put the vegetables and rice into a pan and heat them 

Scene 4: cut and wash the leafy vegetables 

Scene 5: mash the pumpkin in the pan, put the leafy vegetables 
in the pan and mix all the ingredients 

Scene 6: prepare the side dish tahu goreng (fried tofu) 

Scene 7: prepare dabu-dabu (chili sauce) 

Scene 8: serve the dish 
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4.2 Results 
4.2.1 Referent and general referential strategies observed 


There are 45 entities mentioned in the narrations of the four speakers; the range of en- 
tities that each speaker mentioned varies depending on the speaker, and the term for 
the same entity may vary among speakers, too. The referents can be grouped into the 
semantic categories below. 


* The speaker (1 type): Isye 


* The name of dishes (3 types): tinutuan ‘porridge’, tahu goreng ‘fried tahu’, and 
dabu-dabu or laburan ‘chili sauce’ 


e Ingredients (1 type): bahan-bahan ‘ingredients’ 


e Base ingredients, i.e. root vegetables and rice (6 types): ubi ‘potato’, batata or ubi 
manis ‘sweet potato’, ubi kayu ‘cassava’, sambiki ‘pumpkin’, milu ‘corn’, beras 
merah, beras ‘rice’, aer ‘water’ 


* Leafy vegetables (6 types): sayor ‘leafy vegetables’, bayam ‘amaranth vegetable’, 
kangkung ‘water spinach’, gedi ‘aibika leaf’, kukuru, balakama ‘basil’, sarimbata, 
baramakusu, goramakusu ‘lemongrass’ 


e Ingredients for side dishes (8 types): tahu ‘soybean curd, tofu’, bawang merah ‘shal- 
lot’, bawang putih ‘garlic’, garam ‘salt’, tomat ‘tomato’, rica ‘chili’, ikan roa ‘dried 
fish’, minyak kelapa ‘coconut oil’ 


e An attribute or a part of ingredients (4 types): kuli ‘skin’, daong ‘leaf’, isi ‘contents, 
edible part of vegetable’, warna (kuning) ‘(yellow) color’ 


* Cooking tools and so on (6 types): blanga/panci ‘pan’, kompor ‘stove’, mangko 
‘bowl’, pantumbu ‘pestle’, cobe-cobekan ‘mortar’, piso ‘knife’ 


* Body parts of the cook (2 types): tangan ‘hand’, jare ‘finger’ 


* Others (8 types): cacing ‘worm’, vitamin ‘vitamin’, kelihatan ‘appearance’, nama 
‘name’, priksaan ‘test’, hasil ‘result, pedis ‘spicy (n)’, orang ‘person’ 


The text length and number and varieties of the referents mentioned vary among the 
speakers. Table 4 shows the number of words and referents included in each narrative. 
Each referent can be expressed by either a personal pronoun, a demonstrative pronoun, 
or a lexical NP. Table 5 counts the occurrences of each strategy. 

It should be noted that the argument of the predicate is not expressed when it is salient 
in discourse; category zero counts such arguments. 
The actor (the cook) is not mentioned at all in three of the four narratives and is men- 
tioned only once (by the third person singular pronoun dia) in the remaining narrative. 
Other non-agent arguments are also often not expressed; a series of cooking processes 
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Table 4: The number of words and referents included in each narrative 


Speaker Words included Types of referent mentioned 
I 444 41 
H 525 38 
D 336 27 
A 478 32 


Table 5: Occurrence of each strategy 


Speaker Zero Personal Demonstrative Lexical NP 
pronoun pronoun 

I 131 8 18 105 

H 111 5 12 95 

D 112 1 4 78 

A 117 6 8 89 


is expressed by a co-ordinate clause, and the entity mentioned in the first clause is not 
expressed in the clauses that follow it. Consider sentence (21), which consists of coor- 
dinate clauses expressing a series of actions processing garlic. Here, the actor does not 
occur throughout the sentence, and the patient, bawang putih ‘garlic’ occurs only once 
in the first clause, but not in the three clauses that follow. 


(21) Aa kase ancor bawang <me->'° bawang putih so kase ancor ¢ lagi iris-iris d 
ITJ CAUS crush onion <re-> onion white PFV caus crush again slice 
lagi hh campur ¢ di tahu. 


again ITJ mix at tofu 


'Aa...(she) crushes the onion...the garlic, after crushing, (she) will slice (it) and 
mix (it) with tofu’ (speaker H 37-38) 


In what follows, we focus on how lexical NPs are marked with the articles and/or the 
possessive. A lexical NP may occur (i) in unmarked form, that is, a bare NP, (ii) with 
the article tu or ni, (iii) with possessives depe or dia pe, (iv) with both the article tu and 
the possessive, (v) with a postposed demonstrative, or (vi) with =nya, the third person 
singular possessive enclitic used in standard Indonesian. 

Most of the possessives are that of the third person singular depe in the text; the text 
includes only one example of the lexical noun possessor, sambiki le pe kuli [pumpkin 
also Poss skin] ‘pumpkin’s skin’. 

Table 6 shows the occurrence of the article and the possessive construction. 


In this utterance, the speaker started to say bawang merah ’shallot’, and then corrected herself saying 
bawang putih ‘garlic’. 
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Table 6: Occurrence of the determiner and the possessive construction 


Sum ofthe Unmarked ART POSS tu DEM -nya Others 
lexical NPs (tu/ni) * POSS  Pre-posed  Post-posed 
I 105 62 32 (31/1) 8 2 0 0 0 0 
H 95 45  3(12) 41 0 0 28 3 1 
D 79 47 19 (14/2) 9 2 3 1 0 1 
A 88 77 6 (6/0) 3 0 0 1 0 1 
*(nit+ ini) 


As observed in §2, in current MM the pre-head noun position is much more frequently 
filled by the article than by a demonstrative. This data confirm the observation; we can 
see only 3 instances of pre-head noun demonstratives compared to 65 instances of ar- 
ticles. We also mentioned the variation in form of the third person singular possessive. 
The short form depe occurs much more frequently (66 examples) than the long form dia 
pe (3 examples). 

The individual narratives exhibit considerable variation in the frequency with which 
each speaker uses the two strategies — the determiner and the possessive. For example, 
speaker I prefers to use the article, while speaker H prefers the possessive depe. Speaker 
D uses both in similar frequencies, while speaker A rarely uses either of the markers. 

Notwithstanding the difference in preference in using each device, the use in the text 
maintains the basic semantic function of the determiners and the possessive, which we 
have shown in §3; the articles mark a textual-situationally given referent, while the pos- 
sessive depe or dia pe marks a referent associable to a given referent or utterance situa- 
tion. 

Table 7 shows the distribution of NPs with an article and the possessive depe in a 
textually accessible environment. 


Table 7: The distribution of the articles and possessive depe 


Articles Possessive depe 


Total 59 61 
Not textually accessible 3 30 
Textually accessible 56 31 


Because of the nature of the text, most of the referents are visible to both the speaker and 
addressee.!! That makes it difficult to verify how direct situational accessibility affects 
both devices. The fact that a considerable number of NPs were not marked by either 
of the devices, however, suggests that situational accessibility is not a crucial factor for 
either of the markings. 


"The addressee in any given narrative is whichever of the authors was present at the time of recording. 
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Regarding textual-accessibility, we can see a clear difference of frequency between the 
articles and possessives. In the 59 occurrences of the NP marked with articles in total, 56 
refer to a textually accessible — in other words, previously mentioned, entity. 

In contrast to the articles, as expected by the observation of 83, textual-accessibility 
does not influence the use of the possessive depe. 

In the following sections, we will see the details of how each strategy works in the 
text. 


4.2.2 Textually accessible use of the article 


As mentioned above, in almost all the occurrences the NPs marked with an article refer 
to a textually accessible referent. The frequency of tu is far higher than that of ni, as seen 
in Table 6, which supports Prentice's view that ni is semantically marked (see 83). From 
the text obtained by the experiment, though, we could not clearly see the functional 
difference between the two articles. 

As mentioned in 82, the determiner tu occurs with core arguments (S, A, and P). How- 
ever, not all textually given S and P referents are marked by the determiner. Table 8 
shows the frequency of use of the determiner for textually given S and P referents.! 


Table 8: The frequency of the form tu and depe marking for a given S and P 


referent 
Speaker Textually accessible ASP Marked by ART 
I 55 38 (69%) 
H 30 3 (10%) 
D 35 21 (60%) 
A 35 6 (20%) 


The preference varies among the speakers. Speaker I and D more frequently used tu 
than the other two speakers. They are younger than the other speakers, and so this may 
represent a change in progress. 


4.2.3 The use of the possessive pronoun 


As shown in 82 and $3, the possessive covers anaphoric associative use as a part of its 
possessive meaning and also covers the larger situational use of Hawkins (2015) as a 
result of semantic change. 

The obligatory marking of the possessor mentioned in 82 is attested by the narratives. 
Sentence (22) is a typical example. 


No given A occurs in the four texts. 
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(22) Sertaso ta-kaluar depe kuli, mo kupaslei tu  sambiki. 
after PFV PAsS-peel 3sc.Poss skin FUT peel again ART.D pumpkin 


"After all the peel has been removed, (then she) will peel the pumpkin, too. (H 13) 


The form depe in (22) retains its possessive meaning and indicates that the referent of the 
whole NP is associable to the referent of previously mentioned NP. In actual sentences, 
the associative use and larger situation use cannot always be separated clearly. 

Consider sentence (23). This is the first sentence in scene 6 (preparation of a tofu 
dish), and the antecedent of depe in the NP depe tahu ‘the tofu’, is not clear, or is at least 
unavailable in clauses that directly precede sentence (23). 


(23) Skarang mo bekeng depe tahu. tahu taro di panci. 
now FUT do 3sG.POSS tofu tofu put at pan 


"Now (we) want to make the tofu. Put the tofu in the pan? (I 052) 


Sentence (24) provides a similar example. This is the first sentence in scene 7 (preparation 
of chili sauce), and the antecedent of depe in the NP depe laburan ‘the sauce’, is not clear, 
or at least is unavailable in the clauses that directly precede it. 


(24) Itu mo bekeng depe laburan. 
that FUT make  3sG.Poss sauce 


‘There (she) is going to cook the (its) sauce’. (D 80) 


According to the speaker, in both cases, the possessor is the main topic of the whole text: 
tinutuan ‘Manado porridge’, fried tofu and chili sauce always come together with the 
porridge as a side dish and can be considered a part of the dish. 

The dish tinutuan does have prior mention and we could therefore say that sentences 
(23) and (24) are examples of anaphoric associative use. But the prior mention of tinutuan 
is made in the very beginning of the whole narrative — far from sentences (23) and 
(24) (51 and 78 clauses away from each depe NP, respectively). It is therefore difficult 
to consider the NP tinutuan to be antecedent of the possessive depe. It may be more 
plausible to think that the referent of depe NP is associable with the larger situation in 
which the utterance was made, that is, watching, and talking about, the cooking process 
of tinutuan. 

Table 9 shows the frequency with which each speaker uses depe; each use is classified 
into those that have an antecedent available in directly preceding clauses - in other 
words, associative anaphoric use and larger situational use. 

Differences among speakers are observed in their use of larger situational depe. 

As seen in Table 9, one of the four speakers (Speaker H) showed a marked preference 
for wider topic depe, while Speaker I did that to a lesser extent. Speaker H’s distinct use 
of depe is clearly seen in the beginning of his narrative, where he introduces ingredients 
immediately after the title tinutuan ‘Manado Porridge’ is shown. Sentence (25) shows 
that part; here, speaker H marked the NP expressing ingredients with depe '3sc.Poss'. 
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Table 9: Frequency of the form depe 


Lexical NP Possessive 


Sum  Associative anaphoricuse Larger situational use 


I 105 10 6 4 
H 96 43 13 30 
D 78 12 3 9 
A 89 5 4 1 
(25) a. Mo bekeng masakan nama-nya tinutuan. 
want make food name-3sG.POSS(BI) tinutuan 
‘(She) wants to cook food named tinutuan: (Speaker H: 01) 
b. Adoe pe sadap skali ini, aah ini batata, depe batata, 
ITJ ITJ ITJ delicious very this rr) this sweet.potato, 3sG.POSS sweet.potato, 
depe ubi. 


3SG.POSS yam 


‘Oh, it is very delicious, this is sweet potato, the sweet potato, the yam 


Unlike H, the other three speakers introduce the ingredients without any marking. In 
sentence (26), speaker I describes the same scene. 


(26) Bahan-bahan, bete, ubi kayu sambiki milu... 
ingredients taro sweet.potato pumpkin corn 


‘Ingredients...taro, sweet potato, pumpkin, and corn... (Speaker I: 02) 


Differences among speakers are also seen in the description that follows (25) and (26), 
respectively, which explains the cooking procedure. Sentence (27) is a description that 
follows sentence (25). Speaker H keeps employing depe for referring to the ingredients 
given in the previous part of his utterance; here, one of the ingredients batata ‘sweet 
potato’ is marked with depe. 


(27) Aah sekarang depe batata mo di-kupas kase kaluar depe kuli. 
ITJ now 3SG.POSS sweet.potato FUT PASS-peel give go.out 3sc.Poss skin 


‘Ah, now (she) is going to peel the potato, peel off the skin’ (Speaker H: 11) 
In contrast to that, speaker I employs tu to mark all the ingredients that were given in 


the preceding part of the utterance. Sentence (28) is a part of the description that follows 
sentence (26). 


(28) Pertama kase bersi tu ` bete, kupas depe kuli. 
first caus clean ART.D taro peel 3sG.Poss skin 


“First, clean the taro, and peel its skin: (Speaker I: 18) 
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It should be noted that all the speakers use both strategies to a greater or lesser extent. 
Speaker H, who very frequently uses depe, also uses tu twice to mark a textually acces- 
sible referent, as in sentence (22), while speaker I, who uses tu for most of the textually 
given referents, also employs larger situational depe, as seen in sentence (23) above. 

The variation observed in the frequency of each device among speakers, therefore, is 
not caused by differences in the referential system each of them employs, but by which 
strategy they prefer to code an anaphoric relation of a referent in the discourse and 
discourse situation. Speaker I prefers to code a relation of a referent in the previous 
discourse and therefore uses anaphoric articles more frequently, while speaker H prefers 
to relate a referent to a shared situation told by the whole discourse and therefore uses 
the possessive more frequently. 

As mentioned in 82, the article and possessive may co-occur in one NP. The elicited 
text includes three examples of such a co-occurrence. Example (29) below and exam- 
ple (10) above from the elicited text and (20)b and (20)c above, which are spontaneous 
utterances, show this compatibility. In sentence (29), the article tu indicates a textual- 
situational accessibility and the possessive depe indicates that the referent can be asso- 
ciated with the shared larger situation. 


(29) Kase ancor tu depe | sambiki supaya dapa lia warna kuning. 
cause smash ART.D 3sG.POss pumpkin so.that get see color yellow 


'(We) smashed the pumpkin, so that we could see the yellow color: (I 42) 


This suggests that the semantic domain each device covers is not exclusive to the other 
and belongs to intrinsically different semantic dimensions; one may mark the referent as 
textual-situationally accessible and, at the same time, as identifiable through association 
with the larger situation shared between the interlocutors. 


5 Summary and discussion 


We have shown referential strategies of MM, with special focus on how a lexical NP is 
marked according to the information status of the referent. MM has two strategies to 
mark so-called “definiteness”: articles and the third person singular possessive depe. The 
articles are derived from demonstratives and are used for direct situational reference 
and anaphoric reference, while the possessive is used for references in which some kind 
of association is required for identification, which corresponds to anaphoric associative 
use and larger situation use of English in the classification of Hawkins (2015). 

Both devices still retain their original semantic functions. The semantic domain of 
the articles does not extend beyond textual-situational accessibility, a direct semantic 
extension of the demonstratives; while the possessive does not cover all the "associative" 
relations that would be expressed by the definite NP in English, as seen in 83. 

Demonstratives are a well-known source of definite markers in many languages. MM 
articles have established a syntactic position in NPs separated from the postposed demon- 
stratives, and especially tu (derived from the distal demonstrative) has undergone seman- 
tic bleaching. We could expect that the use of the articles might be extended further to 
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indirect reference, such as anaphoric associative or larger situational use (Hawkins 2015). 
This cross-linguistically plausible scenario, however, seems to be blocked by the seman- 
tic extension of the possessive depe, at least in the present stage. 

The article tu and possessive depe may co-occur in one NP. This fact suggests that 
the semantic domain which each form covers is not exclusive to the other and belongs 
to intrinsically different semantic dimensions; one may mark the referent as textual- 
situationally accessible and, at the same time, as identifiable through association. 

A very similar type of referential system with demonstratives and possessives is ob- 
served in Cirebon Javanese, a genetically related language (Ewing 1995; Ewing 2005). In 
Cirebon Javanese, as in MM, the determiners derived from the demonstratives mark di- 
rectly shared identifiability, and textual-situational accessibility, while the possessive 
suffix -é marks identifiability through indirect association. The two devices can fre- 
quently co-occur in one NP, because they “are not in some sort of complementary dis- 
tribution” (Ewing 1995: 80). 

Similar, but apparently more grammaticalized patterns of marking are observed in 
Fehring, a dialect of North Frisian. In Fehring, according to Lyons (1999: 161ff), which is 
based on the description of Ebert (1971a,b), and De Mulder & Carlier (2011), the strong, 
less grammaticalized, article is used for textual-situational accessibility, while the weak, 
more grammaticalized, article is used to indicate anaphoric association, unique entity, 
and generic entity (De Mulder & Carlier 2011: 529). The two articles exhibit complemen- 
tary distribution in the pre-head noun determiner slot. The result of definite marking 
development in MM may be the pattern observed in Fehring. 

Another possible development may be that one of the two strategies becomes more 
dominant than the other. As shown in 84, among the four speakers who have provided 
narrative data, one elder speaker prefers to use the possessive, while the two younger 
speakers prefer to use the articles. From this generational difference, we might predict 
that the article will become dominant and extend its semantic domain to indirect refer- 
ence in the future. 

MM is rapidly obtaining native speakers. As it goes in this direction, processes of stan- 
dardization or homogenization could be expected to affect the marking of definiteness. 
The process should be monitored through ongoing research. 


Abbreviations 

1,2,3 the 1st, 2nd, 3rd person NEG negation 
ART.D  distalarticle PASS passive 

ARTP proximal article PFT perfect 

CAUS causative PFV ` perfective 
DEM.D distal demonstrative PL plural 

DEM.P proximal demonstrative POSS possessive 
EXCL exclusive PST past 

FUT future PTC discourse particle 
ITJ interjection RED reduplication 
ITR interrogative SG singular 
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This paper discusses the information structure in Sembiran Balinese, an endangered, con- 
servative mountain dialect of Balinese. It presents the first detailed description of the ways 
topic, focus and frame setter in this language interact with each other and with other ele- 
ments in grammar. It is demonstrated that Sembiran Balinese employs combined strategies 
that exploit structural positions, morpho-lexical and syntactic resources in grammar. The 
description is based on a well-defined set of categories of information structure using three 
semantic-discourse/pragmatic features of [+/-salient], [+/-given] and [+/-contrast]. This 
novel approach allows for the in-depth exploration of the information structure space in 
Sembiran Balinese. The paper also highlights the empirical-theoretical contributions of the 
findings in terms of the significance of local socio-cultural context, and the conception of 
information structural prominence in grammatical theory. 


1 Introduction 


Sembiran Balinese is one of the endangered conservative dialects of Balinese (i.e. Bali 
Aga, or Mountain Balinese). It is spoken by around 4,500 speakers in the mountainous 
village of Sembiran in northern Bali.! Sembiran Balinese has a similar morphosyntax to 
Plains, or Dataran Balinese, but it is different in that it lacks the speech level system 
characteristics of Plains Balinese.” The noticeable difference is therefore related to the 
lexical stock, including the pronominal system, which is discussed in 82. 


ISBD should be distinguished from the Plains Balinese dialect, which lexically has been influenced by many 
other languages, namely, Javanese, Sanskrit, English, Arabic and Indonesian. Morphologically, both dialects 
have slight differences in prefix and suffix systems, but syntactically, both dialects have the same syntactic 
marking typology. 

?Sembiran Balinese lacks the elaborate speech level system of Plains Balinese; however, the data suggests 
that there has been considerable contact with Plains Balinese, with the speakers being bilingual and fully 
aware of the politeness and speech level system. For example, the use of code-switching with the polite 
pronoun tiyang in addition to code-switching with Indonesian words was found. 


I Wayan Arka & I Nyoman Sedeng. 2018. Information structure in Sembiran Ba- 
Mil linese. In Sonja Riesberg, Asako Shiohara & Atsuko Utsumi (eds.), Perspectives on 


information structure in Austronesian languages, 139-175. Berlin: Language Science 
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Sembiran Balinese is relatively underdocumented compared to Plains Balinese. Previ- 
ous studies on Sembiran Balinese include studies by Astini (1996) on consonant gemina- 
tion and by Sedeng (2007) on morphosyntax. A more comprehensive documentation of 
Sembiran Balinese and other Bali Aga varieties is needed. 

The present paper on information structure in Sembiran Balinese primarily builds on 
Sedeng's (2007) work. Our paper is the first thorough description of the information 
structure in Sembiran Balinese, based on a well-defined set of categories of information 
structure using three features ([+/-salient], [+/—given] and [+/-contrast]). The adopted 
novel approach makes it possible to map out the information structure in Sembiran Bali- 
nese in considerable detail and depth, revealing the intricacies of the semantics, syntax 
and pragmatics involved. The data provides fresh empirical evidence not only for the 
distinction of the known major categories of rocus vs. TOPIC, but also for the subtle dis- 
tinction of FRAME SETTER vs. (CONTRASTIVE) TOPIC. Typologically, certain aspects of the 
information structure patterns observed in Sembiran Balinese are consistent with the 
patterns found in Plains Balinese (Pastika 2006) and other Austronesian languages with 
voice morphology, such as Pendau, a language in central Sulawesi (Quick 2005; 2007). 

The paper is structured as follows. Grammatical relations and related salient features 
of Sembiran Balinese are discussed in 82. This is followed by an overview of information 
structure and the proposal of decomposing topic and focus into three features ([salient], 
[given] and [contrast]) in 83. The main discussions with the presentation of the data 
and analysis are presented in §4 for topic, 85 for focus and $6 for frame setting and 
left-periphery positions. The conclusion and further remarks are provided in the final 
section. 


2 Grammatical relations in Sembiran Balinese in brief 


Sembiran Balinese is a conservative dialect of Balinese. The conservative nature is first 
evident by the retention of an archaic Austronesian feature already lost in Plains Balinese, 
namely the pronouns (a)ku/-ku and engko/-mu, as shown in Table 1. 


Table 1: Pronominal systems in Sembiran Balinese 


PERSON FREE PRONOUN BOUND GENITIVE PRONOUN 
1 aku, oké, kaka, icang -ku 

engko, cahi, nyahi -mu 

iya -a 


The pronouns in bold in Table 1 are those that are also shared with Plains Balinese. The 
bound pronouns in their genitive function in the nominal typically appear with the nasal 
ligature -n and the definite suffix -e, leading to the morphologically complex bound forms 
of -kune, -mune and -ane for the first, second and third persons, respectively. An example 
is given in (1): 
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(1) Engkosa mutang sa engko kén panak-mu-n-e. 
2 PART MID.debt PART 2 to child-2-LIG-DEF 


"You still owe your son/daughter (a ritual)? (Sedeng 2007) 


There is also an intriguing difference in which certain intransitive verbs in Sembiran 
Balinese use the Actor Voice (AV) prefix N- with a prenasalised segment retention rather 
than the middle (MID) voice ma- used in Plains Balinese, as shown in (2). This further 
indicates the conservative nature of Sembiran Balinese given the fact that a prenasalised 
segment is an ancient and widespread feature of Austronesian languages (Blust 2013: 
224); however, it should be noted that this prenasal segment retention only applies for 
intransitive verbs. In transitive verbs, the nasal property of the AV prefix N- assimilates 
with the initial consonant, resulting in no prenasal segment, e.g. teguh ‘bite’ — neguh 
(<N-teguh) ‘av-bite’. 


(2) Sembiran Plains 
Root Balinese Balinese ` Gloss 
a. besen mbesen mabesen ‘send message’ 
b. pupur mpupur mapupur ‘make up with powder’ 
b. salin ^ nsalin masalin ‘change dress’ 
c.kisid  ngkisid makasid ‘move’ 


The morphosyntax of Sembiran Balinese is exactly like Plains Balinese. Following the 
conventions of language typology (Comrie 1978; Dixon 1979; Croft 2003; Haspelmath 
2007; Comrie 2005; Bickel 2011), grammatical relations are represented using the abbre- 
viated labels, as described in (3). These labels, particularly A vs. P and G vs. T, are distin- 
guishable by certain semantic entailment properties (Dowty 1991; Bickel 2011; Witzlack- 
Makarevich 2011; among others). The same labels are used in this paper when the argu- 
ments alternate, e.g. the same label A is used for the most actor-like argument in the 
active structure (i.e. core A argument) and in its passive counterpart, which is grammati- 
cally an oblique A. When necessary, a specific semantic role is specified for clarity, e.g. P: 
goal, meaning a goal of a three-place predicate that is treated as P as it enters a transitive 
structure. 


(3) Grammatical functions: default generalised semantic relations 

= sole core argument of an intransitive predicate 

= most actor-like argument of a bivalent transitive predicate 
most patient-like argument of a bivalent transitive predicate 
- most goal/recipient-like argument of a trivalent predicate? 
- theme of a trivalent predicate 


a QO > o 
T 


Sembiran Balinese also exhibits a grammatical sUBJECT or PIVOT, which plays a role in 
complex clause formations, such as control and relativisation. The voice system regulates 
the selection of a particular role as a pivot, which may also bear a particular discourse 


Based on applicativisation in Balinese, G is also the generalised role for a source/locative-like argument. 
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function of topic or focus. In the following examples, which show syntactic control, the 
verb mati-ang ‘dead-causE=kill’ is in the UV form in (4a). P is selected as the pivot and 
is therefore controlled by (i.e. understood as the same as) the matrix subject engko. In 
contrast, in (4b), because the verb is in the AV form, the A argument is selected as the 
pivot and understood as the same as the matrix subject. 


(4) a. Saking engko dot [ _ mati-ang oké]? 
really 2sG want ^ uv.dead-cAvus 1sc 
"You really want me to kill you?' 
b. Glema-néné nagih [  ngmati-ang i rangsasa]. 
person-DEF want AV.dead-cAUs ART giant 
‘This person wants to kill the giant’ (Sedeng 2007: 135) 


In terms of word order, Sembiran Balinese is an A/S-V-P (or SVO) language, with an alter- 
native order reflecting different information structure. Sembiran Balinese is like Plains 
Balinese in its phrase structure, which are schematised informally in (5). 


G) 
CP (Extended Clause) 


bn cu 


XP CP (Extended Clause) 


(Frame/ContrTopic) Fw 


XP C 


(Focus) P s 


C IP(Finite Core Clause) 


SC e 


Pivot T 


Date 


I XP:Predicate 


The predicate is not necessarily verbal; hence, XP (with X being any lexical category). 
The grammatical subject/pivot is part of the finite core clause structure, precisely in 
Spec, with the IP not shown in (5). 

A sentence can have units placed sentence-initially. This type of sentence is analysed 
as having an extended clause structure. Formally, in terms of X-bar syntax, this extended 
structure has a unit that is left-adjoined to the clause structure. The adjoined element 
bears the pragmatically salient discourse functions (DFs), frame/contrastive topic and 
contrastive focus, typically in that order. The left-most sentence's initial DF position is 
often called a left dislocated or detached position. The focus position is closer to the core 
clause (IP) structure. It is called the Pre-Core Slot in Role and Reference Grammar (RRG) 
(Van Valin 2005; Van Valin & LaPolla 1997). It is a position in [Spec, CP] in terms of the 
X-bar syntax adopted here. 
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Evidence for the structure shown in (5) is based on the following facts. First, there is 
evidence associated with interrogatives with question words (QW) in free clauses. The 
QW focus can appear in situ or can be fronted. When fronted, it must come in [Spec, CP] 
linearly before the core clause structure (IP). This is exemplified in (6a). In this sentence, 
the subject cening appears in its position within IP; however, the subject can be fronted 
marked with a topicaliser en (buat) ‘as for’ as in (6b), where it appears before the QW 
focus expression buwin pidan. Note that the fronted constituent (en) cening is precisely 
a contrastive topic. Crucially, this contrastive topic with the explicit marking with en 
cannot come after the QW focus, as can be observed from the unacceptability of (6c). 


(6 a. [[Buwin pidan]p,. [cening lakar mlali]yp]cp? 
again when [kid FUT MID.go.sightseeing 
"When again will you go (there)?' 
b. [[(En) cening]yop. [buwin pidan]goc [_ lakar mlali]rp]cp? 
as.for kid again when FUT MID.sightseeing 
‘As for you, kid, when again will you go (there)? 
c. *[[buwin pidan]foc [en cening}top, [__ lakar mlali]rp]cp? 


Additional evidence is based on finite complement clauses. Complement clauses are 
structurally CP with QWs like pidan ‘when’, apa ‘what/if’, ken ‘which’ and nyen ‘who’ 
that can function like complementisers, appearing as part of the CP taking the finite 
clause, as in example (7a). An important point to note from (7a) is that the adverbial 
phrase buwin mani ‘tomorrow’ is part of the complement clause CP, as in the partial 
phrase structure tree shown in (7b). While appearing before apa, it is an adjunct of the 
embedded clause, not of the matrix clause. The matrix clause has its own temporal ad- 
junct, namely ibi ‘yesterday’. Also note that the adverbial buwin mani is fronted, result- 
ing in a focus interpretation that is indicated by capital letters in the free translation. 


(7) a. Meme ibi ntakon [buwin mani apa [ia teka mail len, 
mother yesterday av ask again tomorrow if 3sG come here 


"Mother yesterday asked whether TOMORROW he would come here: 


AdvP C 
buwin mani C IP 


apa ii 


ia teka mai 
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The evidence that there is a focus position associated with CP positions before IP is based 
on the exclusive focus marker ane, which is also a relativiser. The syntactic constraint is 
that it must also be associated with the pivot. Structurally, this means that the presence 
of (a)ne requires that the position [Spec, IP] and the positions before it (i.e. [Spec, CP] 
and/or C) must be associated with the pivot. Hence, sentence (8b) is correct when ane is 
used to augment the focus expression icang ba in (8b). In both, the A argument icang is 
the focus and the pivot, as evidenced by the form of the verb, which is in the AV form. 


(8 a. Iya]contrTop: [[icang ba]contrgoc i [__i mehang nasilip]cp. 
3sc 1SG PART PIV AV.give rice 


“As for him, I am the one who provided meals: 


b. [Iya]contrTop: [icang ba]ContrFoc i [ane í [i mehang nasi]y]cp. 
3sc ISG PART REL PIV AV.give rice 


‘As for him, I am the one who provided meals: 


c. "[Iya]contrTop ane i [icang ba]ContrFoc i [i mehang nasi]jp]cp. 
3sc REL  1SG PART PIV AV.give rice 


“As for him, I am the one who provided meals: 


An attempt to mark the left-most NP (iya) with ane, as in (8c), is ungrammatical. This is 
because such marking results in a structure with an intervening argument icang, which 
is referentially distinct from the ane-marked NP (iya). Given the requirement of ane, it 
causes two referentially different NPs to compete for the pivot argument. 

Finally, it should be noted that there may be units in the left periphery positions that 
are not necessarily arguments of the main verb. They are represented by XP in (5). For 
example, the verb ngamah ' AV.having meals’ in (9) appear in the left periphery position. 
Syntactically, it is not a dependent unit of the predicate mehang "AN give. It functions 
as a frame or presentational topic (cf. Lambrecht 1994: 177-181); that is, it introduces an 
event of “eating”, evoking and delimiting certain referents in the discourse, including nasi 
‘rice’, which is the new focus. 


(9) [En buat ngamah]pyame, [icang ba]ContrFoc mehang iya nasi. 
if about Av.eat 1SG PART AV.give 3sG rice 


‘As for eating needs, I am the person who gave him meals: 


Additional details for marked discourse functions in left periphery positions will be pro- 
vided in $6. There can also be a right-dislocated position in Sembiran Balinese. This is 
the case for a re-introduced or after-thought topic, which is discussed in $5.3. 


3 Information structure: an overview 
An information structure (i-str) is a structure by which meanings are packaged to accom- 


modate speaker-hearer needs for effective communication in a given discourse context 
(cf. Lambrecht 1994: 5; Vallduví & Engdahl 1996: 460; among others). Examples of the 
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units of i-str that are widely discussed in the literature are different types of topic and 
focus. Topic and focus are expressed by different formal mechanisms in the grammar of 
a given language, depending on the available morphosyntactic, prosodic and/or lexical 
resources. In Sembiran Balinese, voice system, structural positions, nominal expressions 
and prosody are important resources for i-str.* 

The precise mechanism that underpins the various ways in which information is pack- 
aged within and across languages has been subject to intense study (Vallduví & Engdahl 
1996; Erteschick-Shir 2007; Dalrymple & Nikolaeva 2011; among others). For the analysis 
of i-str in Sembiran Balinese in this paper, a parallel model with a LFG-like framework 
was used, which separates different layers of structures to distinguish predicate argu- 
ment structure from linear order constituent structure (c-str), surface grammatical rela- 
tions and information structure. The grammatical relations are represented using labels 
commonly used by typologists, such as A and P, as in (3). It was also recognised that the 
pivot is part of the surface grammatical relations. 

In the adopted framework, there is no one-to-one relation between these layers of 
structures. Thus, different sentences in (10) are driven by different forces in information 
structure. They all have the same (underlying) predicate argument structure, but their 
argument roles are mapped onto different surface grammatical relations and different 
discourse functions. For example, the Actor John in (10) is the grammatical subject-topic 
in (10a), but it is a contrastive focus (while still being a pivot) in (10b) and a completive 
focus and grammatically oblique in (10c). Explicit information about i-str is given as 
necessary, and this is represented by means of annotations, e.g. as in (11) for sentence 


(10b). 


(10) a. John killed the robber. 
b. It’s John who killed the robber. 
c. The robber was killed by John. 


(11) It is [John] 4:contrFoc [who killed the robber |vp.Comment|Given- 


The information structure itself could be considered to consist of different layers with 
different possible associations of clausal constituents. For example, the i-str system at the 
broadest level may operate with two layers showing topic-comment and given (presup- 
posed)-new focus distinctions as seen in Russian, where the topic precedes the comment 
and the given precedes the focus (Foley 2007; Comrie 1987: 405). Thus, the focus expres- 
sion always comes later, and not all of the units of the comment belong to the focus. 
Consider the mini dialogue in (12) from Russian (13). 


(12) S: Maksím ubivájet Aleksé ra (Russian) 
Maxim.NoM kills — Alex-Acc 


“Maxim kills Alexei’ 


^Prosody is not discussed in this paper. While we are aware of the role of prosody in information structure 
in (Sembiran) Balinese, we have not conducted specific research on this topic. This is one of the areas that 
needs further research not only in Sembiran Balinese but also in Plains Balinese. 


145 


I Wayan Arka & I Nyoman Sedeng 


Q: A Viktor-a? 
and Victor-Acc 
“And Victor?’ 
A: Viktor-a Maksim zasciscajet. 
Victor-Acc Maxim.Nom defends 
‘Maxim defends Victor’ (Comrie 1987: 96) 


(13) . Víktora Maksim zaséi8éajet 
[- topic -] [+ - - - comment - - - -] 
[----- given - --- - ] [- focus -] 


The answer in (12.A) has the same information structure represented in (13), where Viktor 
is the topic and part of the comment (i.e. Maxim) is given. Additional complications in 
Balinese Sembiran will be discussed in §5, where part of the comment is fronted and 
gains a contrastive focus. 

Building on earlier studies on information structure (Vallduvi & Engdahl 1996; Erte- 
schick-Shir 2007 2007; Dalrymple & Nikolaeva 2011; Krifka & Musan 2012; among others), 
todefined as a prototypical unmarkedpic and focus were conceptualised as two broad 
categories forming the information structure space where pragmatic and semantic no- 
tions of contrast, salience and givenness are essential. Following Choi (1999: 133), topic 
and focus were analysed as non-primitive notions. It is proposed that they are decompos- 
able into features that capture the three independent but intertwined cognitive-discourse 
properties just mentioned: contrast, salience and givenness. 

The features [salience] and [givenness] are typically topic-related. They encompass im- 
portant semantic-pragmatic properties in communicative events, such as the particular 
frame/entity within/about which new information should be understood (i.e. the “about- 
ness" of the topic), and the degree of importance/salience/prominence of one piece of 
information relative to other bits of information in a given context. The latter is related 
to the “emphatic” element of communication. It reflects the speaker's subjective choice 
of highlighting one element and making it stand out for communicative purposes. 

While often closely linked, salience and givenness are distinct. The two do not always 
go together. New information (i.e. [-given]), for example, can be [+salient]. This is a 
situation in which indefinite/generic referents are assigned emphatic focus, which is 
further discussed in 84.2. 

The feature [contrast] captures the explicit choice of one alternative with the strong 
exclusion of the others in a given contrast set. It can be associated with both topic and 
focus, which is further discussed in 85.5 and 84.2. 

The three features with their values result in eight possible combinations in the 
i-str space, as shown in (14). The features can be used to characterise fine-grained 
(sub)categories of topic and focus and to explore how they interact. 


(14) Grammatical functions: default generalised semantic relations 


a. [+salient, «given, +contrast] = contrastive frame/TOP 
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b. [+salient, -given, +contrast] = contrastive (often fronted) FOC 

c. [+salient, -given, -contrast] = new (i.e. first mentioned indefinite) TOP 
d. [+salient, +given, -contrast] = default/continuing/reintroduced TOP 

e. [-salient, «given, -contrast] = secondary TOP 

f. [-salient, -given, -contrast] = new (completive/gap) FOC 

g. [-salient, -given, +contrast] = contrastive new FOC 

h. [-salient, +given, +contrast] = contrastive secondary TOP 


To clarify the complexity of information structure involved in Sembiran Balinese, the 
conception of common ground (CG) was adopted (Krifka & Musan 2012, and the ref- 
erences therein). Two related aspects of CG should be distinguished: the CG contents 
and CG management. The CG contents refer to the set of information mutually shared 
by speech participants in a given context. This information can be a set of presupposed 
propositions in the immediate/current CG and a set of entities introduced earlier in the 
discourse or general shared information. 

CG is dynamic. It is continuously modified and adapted for communicative purposes 
throughout a speech event, e.g. by the addition of new information to the CG contents. 
The speaker generally has control over how to proceed in a speech event depending on 
his/her communicative interests/goals, possibly also considering the addressee's inter- 
ests/goals. The way the communicative moves are handled to update and develop the 
CG is part of CG management. 

CG management reflects the speaker's perspective and attention characterised by the 
properties shown in (14), e.g. what is assumed/presupposed, singled out and contrasted, 
emphasised or new in a given communicative episode. Thus, in example (10), where the 
A and P referents ‘John’ and ‘the robber’ are both [+given] (i.e. already shared in the CG), 
the speaker has more than one option to restructure the information depending on his/ 
her interest or attention for effective communication. The use of the passive structure 
(10c), for instance, reflects the choice that P robber is of interest and is considered salient 
about which the new information, the event of killing, should be understood. 

In the following sections, the interactions among the properties shown in (14) in Sem- 
biran Balinese are illustrated in more detail. 


4 Focus 


Focus has been characterised in the literature in terms of two properties: informational 
newness and the presence of alternatives. In terms of the features outlined in (14), the 
first property is captured by the feature [-given]. That is, the focus is the informative, 
new and non-presupposed part of the proposition (Lambrecht 1994; Vallduví & Engdahl 
1996; Dalrymple & Nikolaeva 2014; among others). It is the information added to the 
CG by (part of) comment expressions in statements, question-answers in dialogues or 
actions required in commands. 
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Focus can also be characterised in terms of the presence of alternatives (Krifka 2008; 
Krifka & Musan 2012). This is particularly clear in the case of the contrastive focus and 
the contrastive topic, which embeds the focus, as discussed in the previous section. The 
concept of "alternative" is part of the conception of the set, and according to Krifka, the 
presence of alternatives is in fact central to the definition of focus. There is a presence 
of strong alternatives with an overt focus marking in Sembiran Balinese that carry the 
[*contrast] feature. This is to distinguish it from an unmarked new focus discussed in 
this section, which is [-contrast], i.e. carrying no overt contrast in the expressions and 
no (clear) contrastive sets in the current CG other than alternatives due to a general 
knowledge of things in the world. In $4.1, this general local knowledge in Sembiran 
Balinese is exemplified to illustrate the point that focus indeed shows the presence of 
potential alternatives in a given shared local socio-cultural setting. The understanding of 
the choice of one alternative instead of another in a set is implicit and therefore requires 
a good understanding of broader information in the CG. It is argued that a focus related 
to this type of alternative has a weak or implicit 'contrast-like' meaning and is therefore 
categorised as having a [-contrast]. New focus in Sembiran Balinese is discussed in 84.1, 
followed by the contrastive and emphatic focusses in $4.2. 


4.1 New focus 


New/completive focus is defined as a prototypical unmarked focus ([-salient, given, 
-contrast]): it has negative values for the relevant features and contains no strong or 
embedded contrastive element. It is [-given], meaning that the information is not part 
of the CG (e.g. being asked) or is newly added to the current CG either by the speaker 
or the addressee as the speech event progresses. 

New focus is unmarked in the sense that its expression is assigned no specific tagging 
to signal a contrast or any other salience, such as structural fronting and/or the use of 
emphatic markers. New focus is therefore [-contrast, -salient]. An expression with a 
new focus is a constituent unit that appears in its canonical position. While implying 
the presence of alternatives (due to general world knowledge), new focus was analysed 
as [-contrast], as there is no entity present in the current CG with which it is being 
contrasted. 

The clearest instance of a new focus that shows non-presupposed information with 
the presence of alternatives is related to question-answer pairs. This is exemplified by 
polarity interrogatives, as in example (15). Polarity interrogatives, as the name suggests, 
are associated with either ‘yes’ or ‘no’ answers. This type of question clearly illustrates 
the presence of alternatives in a new focus. In (15), ngara ‘no’, instead of the other alter- 
native ae ‘yes’, is the information being asked; hence, it is new. 


(15) Context: Men Dora told a story about how she gave her money to somebody but 
did not get her land certificate, and consequently, she lost her land. 
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Question: Bakat tanah-e? 
uv.obtain land-DEF 


‘Did you get the land?’ 
Men Dora: [Ngara]goc, [ngara] fo. bakat. 
NEG NEG Uv.obtain 


‘No, (I) didn't get (it): 


A new focus in relation to content questions such as nyén ‘who’ and apa ‘what’ also 
implies the presence of alternatives. Thus, the interrogative in (16a) can be analysed as 
having an information structure with a pragmatic presupposition, as shown in (16b) in 
the CG. The shared presupposition is that every traditional garden definitely contains 
certain plants such as oranges, coconuts, and mangoes. The alternatives within this pre- 
supposed set of plants are part of the general local semantic field or knowledge in CG. 
The entity questioned and the answer given (i.e. X - focus) are among the alternatives in 
the set classified as PLANTS commonly cultivated in the garden, which is poh *mango' 
in this case. 


(16) a. Miasa: Tanah-anne Patra m-isi apa dowang? 
land-3sG.Poss Patra Av-contain what PART 


"What is Patra's land planted with?' 
Dora: [Poh cenik-cenik]roc. Mara jani m-isi poh. 
mango small-RED new now AV-content mango 


"Young mango trees. It has just been planted with mangos’ 


b. Presupposition: Patra's land is planted with X, where X € PLANTS 
COMMONLY CULTIVATED IN THE GARDEN 
Question: What is X? 
Answer: X = ‘mango’ (one of the plants commonly cultivated in the garden) 


A new focus in monologue types of genres, such as narratives and descriptions, may 
also imply alternatives. The speaker in this type of genre, being the sole participant re- 
sponsible for additional new information to update the CG, often provides new informa- 
tion piece by piece for easy and comprehensible communication with his/her addressee. 
Crucially, new focus expressions often come with modifiers of some type, flagging one 
piece of information in the current CG signalling one alternative, possibly in anticipa- 
tion of more (alternative) information later in the discourse. For example, the speaker 
(Men Dora) tells her story about herself and discusses her children. In the first sentence 
in her autobiography (17a), the modifier mara besik, ‘still one’ signifies that panak ‘child’ 
(focus) is just one of the set of children that she will discuss. Later in her story, she dis- 
cusses other children, including sentence (17b) about her second child. Here, the adverb 
buwin is used to signify similar new information (‘giving birth’, ‘baby[-boy]’). Note that 
focus expressions include one of the alternatives that is true, e.g. buwin ninnya ‘male 
again’ is used in (17b) instead of the other alternative ‘baby-girl’ in accordance with the 
truth condition of the (updated) CG contents. 
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(17) a. [Ngelah [panak]p]Empn.roc [meme], Top [mara besik]p NewFoc madan 
Av.have child mother still one MID.name 
Butuh Dora. 
Butuh Dora 


‘I (mother) gave birth to the first (lit. still one) child, named Butuh Dora’ 
b. Ba kento [@]ATop buwin sa ngelah [panak]foc; [Ø]s Top buwin 
after like.that again PART AV.have child again 


[ninnya]roc. 
male 


“After that, I again gave birth to a child, (and) (he's) again a baby boy: 


In terms of its structural expression, a new focus is typically part of the comment con- 
stituent and distinct from the topic; however, the comment constituent may be split, as 
in (17a) where the comment VP is fronted, leaving the numeral phrase modifier in the 
object position. The whole predicate in sentence (17a) (i.e. “giving birth to one child’) 
is actually new in the discourse. The split with fronting the VP can be considered the 
speaker's way of assigning some type of emphatic prominence to her phases of moth- 
erhood with a series of childbirths. At this point of the story, it is about the first baby. 
Based on the characterisation of the new focus adopted in this paper, mara besik 'first 
child' is a proper new focus in (17a). The fronted VP, which gains emphatic meaning by 
fronting, can be precisely labelled as the emphatic new focus. It is a marked focus, which 
is discussed in next section. 


4.2 Marked focus: Contrastive and emphatic 


A "marked focus" refers to a focus whose information structure contents in the CG are 
complex, typically characterised by [+contrast, +salient, -given]. The contrastive mean- 
ing ranges from a strong one to a subtle (emphatic) one, with complex encoding at the 
formal expression level. The complexity can be structural, involving the use of an ex- 
tended clause structure with unit fronting to the left-periphery. It may also be accompa- 
nied by specific focus markers. At the content level, the complexity is indicated by the 
presence of an embedded element of a contrast set. 

The presence of a contrast set constrains the contextual interpretation of the focussed 
element. In Sembiran Balinese, it may range from a focus with a strong contrastive mean- 
ing (i.e. the choice of one with a clear exclusion of the other alternative(s) in the under- 
stood set) to a focus with an emphatic meaning. Emphatic focus, as the term suggests, 
encodes a class of salient pragmatic nuances, such as emphasis, affirmation and counter- 
expectation, which the speaker wants the addressee to pay attention to during commu- 
nication. 

There is no clearly defined difference between contrastive and emphatic meanings, 
as both are associated with the contextual presence of contrasting alternatives. For sim- 
plicity, they are discussed under the broad category of contrastive focus. In Sembiran 
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Balinese, they both make use of the same linguistic resources. The difference, if any, ap- 
pears to be a matter of degree, involving how explicit the contrasting entities are present 
in the contrast set and how strong other pragmatic nuances, such as emphasis and affir- 
mation, are expressed in a given context. The degree of the strengths of these nuances 
in Sembiran Balinese can often be seen from the extent of marking present, e.g. whether 
fronting is also accompanied by an overt focus marker. Cases with strong contrast sets 
are discussed first, followed by cases with subtler emphatic nuances. 

Cases for clear contrastive focus can be informally represented, as in (18). The repre- 
sentation shows that X is the contrastive focus if it is the selected member of a contrast 
set in the current CG, with the other contrasting entity, Y, excluded from the set. The 
presence of a contrast set is often strong and expressed by some type of structural and/ 
or particle marking. This is exemplified by the question-answer pair from English shown 
in (19). The question (Q) sets the contrast between two alternatives, which are overtly 
marked by the disjunction or in the question. The answer (A) in (19) selects one (‘the 
white’) and excludes the other. The contrastive focus of the answer (19A) can be repre- 
sented as in (20). Note that what is new in the answer is not the two entities in the set 
(as they are both present in the CG), rather, it is the selection of one of them. 


(18) | Contrastive focus X: ([X]contrFoc; [Y] ---}Foc 
[where the contrasting entity, Y, is clearly 
established in the immediate/current CG.] 


(19) Q: Which laundry did John wash, the white or the coloured? 
A: He washed the WHITE laundry. (Erteschick-Shir 2007: 48) 


(20) {[‘the white’ ]contrFoc; [ the coloured’ ]}foc 


In Sembiran Balinese, strategies used to express contrast include the following: fronting 
to the left-periphery position, structural parallelism, lexical items (e.g. antonymous 
words), polarity particles, focus markers and prosodic prominence (i.e. stress). Fronting 
is the most common strategy, which is often combined with one or more of the other 
strategies. 

Consider the context of contrastive focus in the second clause in (21). The presence 
of polarity ngara in the first sentence sets one (negative) option in the bipolar contrast 
set (X in the representation in (22)). The second clause adds new specific information 
to this negative option by stating that the money was actually corrupted (lit. ‘taken 
and eaten’). Crucially, the speaker provides extra emphasis on this by preposing the VP 
clause-initially. It therefore bears a contrastive focus, as it indicates that the speaker 
strongly highlights the negative option, excluding the positive option (Y). 


(21) Context: The secretary officer in the village was trusted to collect the money 
needed to cover the costs for the issuance of the land certificates for a group of 
people in the village, including the speaker; however, the money was corrupted 
by the secretary and the village head. 
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Tau-tau [pipis-e]P Top [ngara setor=a ` ke kantor]goc, [[juwang=a amah=a di 


know-RED money-DEF NEG transfer=3 to office uv.take=3 uv.eat=3 on 
jalan] conrrFoc|NewFoc [ento]p piv Top [ajak-a wakil prebekel-e]Nswroc- 
road that accompany=3 deputy head.villate-DEF 


‘Surprisingly, the money was [not transferred to the (Land) office lo, that 
(money) was [taken and eaten along the way]contrFoc by him and the village 
head: 


(22) {[X: ‘not transferred /taken and eaten’ ]ContrFoc, [Y: ‘transferred’ ] }NewFoc 


The example in (23) further illustrates contrastive focus, achieved by using passivisation, 
whereby the new information is made the subject. Linking an argument with new infor- 
mation (i.e. new focus) to the pivot is rather unusual as far as information structure are 
concerned; however this is done for a good communicative purpose, namely to achieve 
an element of surprise associated with the new information. That is, the focus item needs 
to be fronted to gain the contrast meaning. Note that sentence (23) is a reported speech 
where the first clause is the reported question asking for water. The second clause is the 
reported answer. The entity ‘blood’ is the new focus, as it is the answer to the question. It 
is also contrastive because it is being contrasted with the expected answer (‘water’). This 
is a folktale, a work of fiction with a giant as the main character. It is full of surprises, 
e.g. the giant eats human beings and drinks human blood. 


(23) a. Context: a story about a girl called bawang, who is asked to cook by a giant 
who eats humans. 
Mara iya nakon-ang [yéh]p Foc, [getih kanya]p Top tuduh-ang=a. 
just.time 3sG Av.ask-APPL water, blood PART UV.point-APPL-3 
"When she asked for water, it was blood that he pointed at’ 


b. {[blood’]contrFoc, ['water(asked)' ]]NewFocus 


In this example, the contrast set is clearly established through (reported) question and an- 
swer pairing. However, in other cases, the elements in the contrast set might be fully un- 
derstood only in relation to a complex locally/culturally specific CG, not simply the truth 
condition involved. For example, the element of [+contrast] associated with a counter 
expectation in the following sentence can be fully understood only in the local cultural 
setting as described in (24): 


(24) Socio-cultural context: the speaker is a poor woman involved in the so-called 
ngadas practice: she was given a female piglet to look after as capital by 
somebody else. The agreement was that when the pig grew and became a 
mother-pig with its own piglets, the speaker should pay the owner back using 


the offspring. 
[Ba ada [ukuran lima bulan]wewgoc [ubuwin]]Comment [kucit-e]p-Top, [mara 
already exist about five month uv.look.after piglet-DEF “just 
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ukuran setengah}contrFoc Lëede-p ` kucit-e]s-rop, [mati]contrFoc [kucit-e]s ton, 
about half size-NML piglet-pEF die piglet-DEF 


"Presumably, for about five months, I looked after the pig. The size of the pig's 
body was about half the size of a mature one's, (but) it died unexpectedly: 


Note that the contrastive focus of the last sentence in (24) arises from the fronting of the 
predicate mati ‘died’. It is also marked by a prominent prosodic stress, resulting in the 
speaker's subtle complex meaning of ‘surprise, unexpectedness, unwantedness’. This is 
understood in the socio-cultural context described in (24), where the piglet is not hers but 
a type of loan capital. The contrastive focus also expresses the speaker's strong feelings 
of disappointment in contrast to her expectation that it would grow and eventually give 
birth to offspring. The piglet's death was premature (at around five months of age). It 
was still relatively small, at half the size of a full-grown pig. 

The contrast element in the focus can often be augmented by the use of focus mark- 
ers in addition to constituent fronting. Here, the use of focus particles ba and jeg are 
exemplified in Sembiran Balinese. Sentence (25) illustrates the use of ba. This particle ap- 
pears to have originated from suba, the adverb/auxiliary meaning 'already/perfective', 
which is also often abbreviated as ba.? Both appear in example (25). The free translation 
is given here to show the emphatic focus involved, namely the long-awaited and good 
news about the completion of the making of the shirt material. The focus particle ba 
carries a sense of relief, or of no more thinking/concern on the part of the speaker. Note 
that it would take days, or even weeks, to complete the weaving. This again points to 
the fact that the emphatic focus has a subtle meaning that would only be understood in 
a given local cultural setting. 


(25) Context, barter-based economy: the speaker promised to give the person 
(addressed below as Nang) a hand-woven material to create a shirt in return for 
six-hundred ears of corn that the person had previously given her. 

Nang, [ba tepud ba}contrFoc [lakar ` baju-ne]s oy. 
father PERF done PART material shirt-DEF 


“Father, it's done, the material for the shirt? 


The focus particle jeg carries a selection of one option instead of the other(s) with neg- 
ative nuances, such as something unwanted or no other alternative. This is often asso- 
ciated with an event/action carried out against the speaker's/addressee's wishes. This is 
exemplified in (26) and (27). In (26), the focus marker jeg appears with the negative sen- 
tence, highlighting the absence of any possession whatsoever on the part of the speaker. 
In (27), it is an imperative sentence, and the instruction to the addressee is that he must 
find a doctor, nobody else (e.g. not a shaman), for the mother. Note that in this second 
sentence, the focus is doubly marked by jeg and ba. 


?It also has a prepositional-like meaning ‘after’ as in (su)ba kento ‘after (like-)that’. The grammaticalisation 
of forms with these meanings has been reported in other languages (cf. Heine & Kuteva 2002: 17, 134). 

$Nang is the vocative use of nanang ‘father’. This kin term is used to address the brother of one’s father or 
any male of the same age as one’s father. 
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(26) Context: The speaker had a hard life with many children to raise. This sentence is 
part of the most difficult time in her life when one of her children was seriously 
ill: 
Kene ojog=a masan parah=sen jeg apa ngara ngelah. 
like.this visit=3 time  hard-very PART what NEG au have 


"Ihe worst time hit hard at this time, I had NOTHING whatsoever: 


(27) Tut Sik, meme jeg dokter bo alih-ang di Jakula nto. 
Tut Sik mother PART doctor PART UV.find-APPL PREP place that 


‘Tut Gasik, as for me (Mother), you just find a DOCTOR (nobody else) for me in 
Tejakula’ 


The example in (28) illustrates an emphatic focus. It is also achieved by predicate fronting. 
The speaker describes her husband by placing emphasis on his negative characteristic of 
being lazy in contrast to an otherwise more positive alternative commonly expected for 
a good husband. This characterisation of laziness is the first mention of this in the text; 
hence, it is an emphatic new focus. The contrastive element has no overt expression in 
the preceding context. It should be understood based on the good values assumed in the 
community as part of the general CG. 


(28) [Gelema kalud ng-luyur ngara nyak pati me-gaé]prea ContrFoc: [sommah 
person PART Av-wander NEG want ever Av-work husband 
meme-né ` ento]sToy. 
mother-POSS DEM 


‘A person who's wandering around, not wanting to work, my husband is’ 


5 Topic 


Topic is defined prototypically in terms of the file-card metaphor (Reinhart 1981; 
Erteschick-Shir 2007; Krifka & Musan 2012) in relation to the comment part of a sen- 
tence and the i-str features given in (14). The following is the definition of topic, adapted 
from Krifka & Musan (2012: 28): 


(29) Definition of topic: 
The prototypical topic constituent of a clause is the one referring to a [+salient] en- 
tity in the CG, under which the information of the comment constituent is stored 


or added. 


As mentioned, the [+salient] feature is meant to capture the most important cognitive 
property of an entity (or a set of entities) in a given context about which attention and 
additional information is given to increase the addressee's knowledge (in statement), is 
requested from the addressee (in question), or when an action is requested (in command). 
This definition is consistent with the traditional concept of “aboutness” topic (Reinhart 
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1981; Gundel 1988; Lambrecht 1994: 210, among others) and “attention” (Erteschick-Shir 
2007: 44). 

The concept of prototype (Rosch 1978; Taylor 1991) was used in the definition in (29) 
to capture different types of topics, particularly because there may be a less canonical 
topic, called a secondary topic. While its referent is present in the CG, this topic is not 
as salient as the default topic, which now becomes the primary Toric.’ The secondary 
topic gains its salience in relation to the primary topic; see further discussions in $5.4. 

Next, the most common type of topic is discussed first, namely the default topic of a 
clause, which is also grammatically a pivot. 


5.1 Default topic 


The term DEFAULT TOPIC is used to refer to the only topic in the basic (i.e. non-extended) 
clause structure characterised by [+salient, +given, -contrast] properties. Its referent has 
been established and shared in the CG (i.e. cognitively/pragmatically salient and given). 
Crucially, it is not contrasted. That is, as far as CG management is concerned, it is a unit 
without an embedded element of contrast. Grammatically, it is the pivot of the clause, 
occupying a unique pivot position in the clause structure. As mentioned in 82, the pivot 
selection and therefore default topic selection is signalled by verbal voice morphology. 

The material realisation of the default topic varies for discourse-pragmatic reasons. It 
can be an overt noun (NP), a free overt pronoun or a zero pronoun. The data suggests 
that this is determined by the activation and relative adjacency of the relevant entity in 
the current CG. An overt common noun topic is typically a definite topic, possibly a re- 
introduced default topic; a pronoun or a zero pronoun is typically a continuing default 
topic. Each is discussed and exemplified in the next sections. 


5.1.1 Common nouns and proper names 


Common noun and proper name topics constitute only 18% of the total default topics 
in Sembiran Balinese. The majority of default topics are pronominals, unexpressed/zero 
subjects (67%) and overt pronominal subjects (15%).Š Default topics are typically definite, 
i.e. having a [+given] property. Nouns gain definiteness in different ways. The most com- 
mon way in text is a second (or later) mention after it is introduced as the new focus in 
a previous sentence. For example, in the following excerpt, the noun jagung ‘corn’ is 
introduced in the first sentence and becomes a definite topic later, flagged by a definite 
marking (-e, ento). Likewise, the NP lakar baju 'shirt material' becomes the topic after 
the second mention, referred to by the definite determiner ento. 


7The term primary TOPIC is the default topic in the presence of the secondary topic. They refer to the same 
kind of topic and are used interchangeably in this paper. 

*'These statistics are based on a limited text corpus of 66,677 words, consisting of traditional folktales and 
a recording of the personal story of Men Dora. The recording was first transcribed in ELAN, and then the 
appropriate tagging reflecting grammatical relations and information structure status was done in ELAN, 
before a simple statistical calculation was undertaken. 


155 


I Wayan Arka & I Nyoman Sedeng 


(30) Ba kento, behang-a meme jagung tigang atak, nunun ntas meme 
after that uv.give-3 mother corn three two.hundred Av.waive then mother 
lakar baju, ba  tepud ento ...ba kento, jagung-e ento ada a=bulan 
material shirt PERF finish that PERF that corn-pEF that exist one=month 
tengah dahar... 
half uv.eat 


“After that, I (mother) was given 600 ears of corns...I then waived shirt material; 
it was then done...after that, the 600 ears of corn were consumed in about one 
month and a half... 


A common noun can gain its topicality (i.e. [+given]) even when first mentioned in the 
discourse through a vocative use. It exophorically refers to the speaker or the addressee 
in a given context. The noun types that possibly function in this way are typically kin- 
term nouns. For example, the default topic in (31) is meme ‘mother’, used vocatively to 
refer to the speaker. This is the first sentence in the autobiography text. It is a topic NP 
because it is the entity/referent (i.e. the speaker, Men Dora) about whom 'giving birth to 


the first child' is being told. 


(31) Context: the speaker Men Dora told her son (i.e. the addressee) about her life. 
Ngelah panak [meme]a Top mara besik, madan Butuh Dora. 
Av.have child.p mother still one MID.name B. Dora 


‘I (mother) gave birth to my first child, named Butuh Dora’ 


A common noun can also gain its [+given] property through a possessive relation with 
the addressee. In example (32), the NP nanang caine, ‘your father’ is the default topic of 
the second sentence. It has not been mentioned in previous sentences. Its referent is part 
of the shared CG information, as it is the father of the addressee who is also the husband 
of the speaker. 


(32) Context: the speaker told her son (i.e. the addressee) about his father. 
Buina ngelah panak mara patpat kento teh, [nanang]; p». xo» cahi-ne 
moreover AN have child.P still four  like.that PART father 2SG-DEF 
ngalih somah buwin. 
AV.take wife.p again 
‘In addition, when I had given birth to four children, your father took another 
wife. 


5.1.2 Pronominal topic: Reintroduced and continuing 


When a referent is highly active in the CG and established as the default topic, there is 
often no need to express it overtly; however, if expressed overtly, it is often realised as 
a pronoun, as in (33a). In this example, the default topic tiyang? (the speaker Men Dora) 


°Note that tiyang is a Plains Balinese pronoun (h.r.). Speakers of Sembiran Balinese are also typically fluent 
in Plains Balinese as well, and code switching is common. 
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is already salient in the general CG. In the two sentences immediately preceding it, the 
pronounisnot the topic. It is a topic in an earlier sentence. In this case, it can be classified 
as a reintroduced topic (see $5.3); however, in the sentences immediately following (33a), 
tiyang (or the speaker, index i) is maintained as the default topic. It is realised as a zero 
pronoun, represented as Ø. The default topic is a continuing topic in these instances. In 
short, a continuing topic is, like a reintroduced topic, a discourse-level topic, but the 
referent of a continuing topic is already present in the immediately preceding sentence. 


(33) Context: the TOP in the two immediately previous sentences is about the 
speaker's two last children out of nine children: 


a. Ba keto mara [tiyang], piv'Top ingelah panak siya. 
after that just 1sc Av.have child nine. 


"Ihen, after, I gave birth to a total of nine children: 


b. [2] A Piv.Top Ba nau ne ba. 


i PERF happy this PERF 
‘I was already happy: 
C. [@]Apivrop Ba lupa [O]apivrop ba ngucapang ... 
i PERF forget i PERF AV.mention 


‘I forgot to mention (something): 


These data from Sembiran Balinese represent a common pattern where an entity that is 
highly salient in a series of immediate states of CG is selected as the continuing topic. It 
is typically formally reduced in its expressions, either as a zero pronoun (67%) or a (clitic) 
pronoun (15%). This fits well with Givón's (1990: 917) observation that referents that are 
already active in the CG require minimal coding. This is also consistent with the findings 
in Plains Balinese (Pastika 2006) and in other Austronesian languages of Indonesia with 
verbal voice morphology, such as Pendau (Quick 2005); however, in other Austronesian 
languages with diminishing verbal voice morphology, e.g. in certain dialects of Sasak, 
the use of pronominal clitics is widespread, and the discourse distribution of nominal 
and voice types is expected to be different (cf. Wouk 1999). 


5.2 New topic 


While topic is typically [+given], it can be [-given]. This is the topic whose referent 
is firstly mentioned in the discourse (i.e. newly introduced in the CG). It is typically 
introduced by the verb ada in Sembiran Balinese. Consider (34), which is the first line 
of a story. The subject of the first clause is an indefinite NP tuturan satua. This is a new 
topic. Then, the second clause provides more specific information about the new topic. 


(34) Ada [tutur-an satua]Newrop. madan I Bapa Sedok. 
exist tell-NML story MID.call ART Father Sedok 


‘There is a story often talked about called I Bapa Sedok’ 
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The sentence in (35) is in the middle of a story, but it is the first time the referent nak 
‘person’ is introduced in the context described in (35). While indefinite and new, nak is 
the topic here, as the information that follows is about this NP, nak. 


(35) Context: The sentences are about the speaker's bad experience. She was deceived 
by the village official and lost her money in the process of the issuance of land 
certificates. Somebody else, called Sumarwi, stepped in to replace her money: 
Ada nak | nimbalin, Sumarwi adan-anne neh nimbalin. 
exist person Av.replace Sumarwi name-Poss REL AV.replace 


"Ihere's a person reimbursing (the money); Sumarwi is the name of the person 
who reimbursed the money: 


5.3 Reintroduced topic 


The term REINTRODUCED TOPIC is used to refer to a topic expression associated with a 
salient entity already selected as a topic earlier but that is picked up again as a topic 
in a clause (cf. Givón 1990: 760); hence [+salient, + given]. It is not associated with a 
contrastive set in the CG, however. The reintroduced topic has been exemplified in (33). 
In this example, the pronoun exophorically refers to the speaker, so there is no ambiguity 
issue in its identification. 

When there is more than one entity in the CG that the third-person pronoun can refer 
to, as the default topic, a pronoun may need further specific information provided by a 
full NP expression. In Sembiran Balinese, this full topic NP may come later in the clause 
in the right dislocated position. This is exemplified in (36). The pronoun iya in the last 
sentence is potentially ambiguous, as there are other participants in the CG indicated by 
the indices i and j. To avoid ambiguity, the speaker provides additional information: “that 
(first) co-wife’ (underlined) in the right detached position, index k. Thus, there are two 
topic expressions referring to the same entity in this sentence; iya is the default topic 
expression, and the full NP madu-né ento is the reintroduced topic. 


(36) Context: the speaker is the second of three co-wives reporting what the first 
co-wife (index k) has said about Sapin, the third co-wife (index i). 
Tawah [I Sapin-e ento] ;. [[panak[-a] ; ] j ngara gaen-ang[-a]a i banten 


strange ART Sapin-prr that child-3 NEG make-APPL-3 ritual 
behan Øs i ngara ngelah, orahhang[-a]a ; aget-se .-kento ` [iya]sTop A 
because NEG Av.have say-3 lucky-PAnT  like.that 3sc 
m-peta, [madu-n-né ento) Top k- 


Av-mention co-wife-LIG-3POSS DET 


‘Sapin į was strange. [Her ; child] j. She ; didn't make any ritual due to her j 
lack of money (when the child unexpectedly died prematurely), and she , said 
she was lucky’, she k said, the (first) co-wife p. 
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5.4 Secondary topic 


The SECONDARY TOPIC is defined in relation to the default or primary topic. It is defined 
as ‘an entity such that the utterance is construed to be ABovT the relationship between 
it and the primary topic (Dalrymple & Nikolaeva 2014: 55). The secondary topic is like 
the primary topic in that it is pragmatically [+given]: It is present in the (immediate) CG; 
however, it is less salient than the primary topic. Saliency reflects some type of promi- 
nence, which can be assessed based on certain properties related to how it is marked 
in a given language, e.g. linear order (with the earlier sentence-initial position being 
more salient than later), linking (with the subject/pivot topic being more prominent than 
the object topic) and explicit marking (with the focus marked by the contrastive focus 
marker, which is more prominent than the [unmarked] new focus). 

In the English example from Lambrecht (1994: 148), sentence (37) contains two topic ex- 
pressions, both expressed by the pronouns he and her. Their referents are already present 
and salient in the CG due to the preceding (37) and (37) sentences. Sentence (37) is about 
John, referred to by the subject he; hence, this is the primary topic. The secondary topic, 
the object her, is part of the comment constituent. The communicative intent of (37) — its 
new information, the focus constituent of the Comment - is the assertion of the “love- 
relation" in which Rosa was not loved by John. 


(37) a. Whatever became of John? 


b. He married Rosa, 


c. but [he]Primaryrop [[didn't really love]foc [her |secondaryToplComment 


The secondary topic, like the primary topic, can also be a continuing topic when its 
referent is already salient and present in the general CG. It is often the case that two 
salient entities in the CG alternate between the primary and secondary topic, depending 
on the focussed predicate involved. Consider the following excerpt from the text in (38), 
which comes after example (32). The speaker repeats the same message with some new 
information about her status as one of three co-wives of her husband. Both the speaker 
and her husband are highly salient in the immediate CG. The speaker (index i) is the 
continuing topic in the three sentences, becoming the primary topic in (38b) and (38c). 
Her husband, realised as the clitic =a (index j), is also the continuing but secondary 
topic in (38b) and (38c). Note that in (38b), the husband becomes the primary topic of 
the adverbial clause, realised as a zero pronoun. 


(38) a. Ba kento mara [meme]Aoy i ngelah panak patpat, 
after that just.after mother Av.have child four 


‘Then, after, I (mother) gave birth to four children; 


b. ØPTopı i Kalahin[-a]A Top2_j [Ø p-Top1 j ngallih somah buwin]. 
vv.left-3 AV.take wife again 


‘I was left by him to take a new wife again’ 
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c. Madu-telu-ang[=a] A Top2_j [meme]p-top1- 
co-wife-CAUs=3 mother 


‘I was made one of his three wives? 


In Sembiran Balinese, instances of the secondary topic are typically the A of the UV 
verbs expressed as (clitic) pronouns. In this case, P is the primary topic, also highly topi- 
cal and selected as the pivot. Sembiran Balinese is like Plain Balinese, in that in both UV 
and AV clauses, the A argument is highly topical and even more topical than the U argu- 
ment. Pastika (2006) presented statistical evidence from a referential distance measure 
(cf. Givón 1994), which showed that the significant factor for the selection of voice type, 
AV vs. UV, in Balinese is the topicality of U rather than that of A. 


5.5 Contrastive topic 


The contrastive topic expression is defined as being associated with [+salient, +given, 
*contrast] features. That is, like the types of topic discussed thus far, it refers to a referent 
already present in the CG and is highly salient (e.g. about which comment information 
is added); however, it differs in that it carries an element of contrast (i.e. [+contrast]). On 
the expression side, the [+contrast] feature has an explicit marking of some type. On the 
CG side, it is associated with an established contrast set of referents. 

A contrast is marked in different ways. In the English question-answer example in 
(39), the contrast set is restricted and established by the nominal siblings in question 
(A) and also by structural parallelism through the coordination accompanied by a par- 
allel prosody in the answer. In this pair of questions and answers, the subject NPs in B 
are instances of a contrastive topic, analysed as having a focus embedded in the topic 
(Erteschick-Shir 2007; Krifka & Musan 2012: 30). Focus carries the presence of alterna- 
tives (Krifka & Musan 2012), an element also shared with [contrast]; however, the focus 
may be simply [-contrast]. That is, new information is added to the common ground 
without an overt contrastive reference to other entities in the CG, which has been dis- 
cussed in detail in 84.1. The contrastive topic is represented as a topic with an embedded 
contrastive focus (ContrFoc). For example, (40): (40i) and (40ii) are the representations 
of the contrastive topics of clauses B.i and B.ii, respectively. The topics consist of a set 
of two salient referents in the CG that are commented on by means of their different 
occupations, which are not shown by the representation in (40). 


(39) A: What do your siblings do? 
B: i [[MySISter]goc] rop [studies MEDicine]goc, and 


ii. [[my BROther ]goc ]rop is [working on FREIGHT ship]goc. (Krifka & Musan 
2012: 30) 


(40) i {[my sister ]conuroc. [my brother’ Uran 
ii. (['my sister ], [my brother ]ContrFoc }Top 
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Parallelism by means of coordination, as shown in the English example above, is com- 
mon cross-linguistically. Parallelism that encodes a contrastive set membership is often 
achieved by using the same or synonymous lexical items in structurally marked con- 
structions, such as a left-dislocated position. A contrastive topic expressed in this way 
is found in Sembiran Balinese. This is exemplified by the topic expression iya ba in the 
second clause in example (41a). The partial information structure representation is given 
in (41b). 


(41) a. [Meme]proy ; ngara ajak-a | ng-(g)ellah-ang tanah warisan di Pramboan 
mother NEG invite-3 Av-own-APPL land inheritance in Pramboan 
mapan [iya ba}qop j, [somah-anne senikan ento] j ajak=a. 
since 3SG EMPH wife-3sc.Poss younger that ^ vv.invite-3 


*[I (mother)] was not invited to share the inherited land in Pramboan because 
it’s [she] j [his younger wife] ; whom he invited: 
b. CG: ([' younger wife'] j ContrFoc» [ mother] d Top 


As seen in (41b), the 'co-younger wife’ is the topic, i.e. the salient participant about/to 
whom a land-sharing invitation was discussed/offered. It is a contrastive topic, with the 
contrast achieved by means of the contrasting element of negation associated with the 
same verb ajak. The younger co-wife is referred to by iya, which appears in the left- 
dislocated topic position and whose pragmatic effect is augmented by the use of the 
emphatic particle ba. The full NP somah-anne senikan ento, which appears in the pivot 
position, provides additional specific information about iya. 

Example (42) also illustrates a contrastive topic. The P object ne, 'this' refers to the 
land being discussed. It is topicalised through fronting to the left-most sentence-initial 
position. This way, it gains its contrastive effect; hence, it is a contrastive topic. Note 
that the verb is in the AV form with the subject/pivot being the A argument cahi. The 
A argument is also pragmatically prominent, appearing with the focus marker ba. Both 
of the referents of the A and P arguments are present in the preceding sentences, as 
described in the context description in (42a). The information structure is informally 
represented in (42b). 


(42) a. Context: Bapak, the officer from the Agrarian Office, measured a piece of 
state-owned land to be granted to Butuh Dora. Two salient entities are 
involved in the first clause: the addressee cahi and the land. 

[Ne]top [cahi]a pivFoc ba ngelahang. Ne ngara ukur Bapak ne. 
this 2sG PART AV.have.APPL this NEG Uv.measure father this 


“As for this piece of land, YOU are the one who owns (it). This one, I didn't 
measure it. 


b. CG: ('this land’]contrFoc, [the other land'] 4o, [you ]contrFoc; [the others ], 
---JFoc 


10Parallelism is a prominent feature of the languages of central and eastern Indonesia, particularly in the 
domain of ritual language (Fox 1988; Grimes et al. 1997; Kuipers 1998; Arka 2010; Sumitri & Arka 2016). The 
information structure in a ritual language require further research. 
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The type of structure given in (42) is of particular interest, as both A and P are equally 
contrastive, highly salient and already present in the CG. This reflects the interaction be- 
tween the topic and focus and creates complications regarding the distinction between 
the primary and secondary topics; however, it appears that in a given structure, only 
one is selected as the most prominent topic. This is the left-most unit, ne “this (land), be- 
cause the rest of the predication is about this referent. This topic functions as the frame 
setter, which delimits the interpretation of the other parts of the sentence. In terms of 
CG management, and in line with the definition of topic presented in (29), based on the 
free translation, it is this topicalised P/object that is closer to the "about topichood’ than 
the subject cahi. The object is more prominent than the subject as far as the information 
structure is concerned; however, grammatically, there is good cross-linguistic evidence 
(Keenan & Comrie 1977; Bresnan 2001; among others) as well as language specific evi- 
dence, e.g. from reflexive binding in (Sembiran) Balinese (Arka 2003; Sedeng 2007) that 
the object is less prominent than the subject/pivot. 


6 Frame setting and left-periphery positions 


6.1 Frame setting and topicalisation 


Frame setting, which is exemplified in English in (43), is part of the so-called delimita- 
tion in information structure (Krifka & Musan 2012). The frame setter healthwise/as for 
his health in this example restricts the predication: the new/gap focus FINE must be 
understood within the frame of '(his) health’. Note that the topic here is John, as the 
predication of being ‘fine’ is about him. 


(43) Q: How is John? 
A: {Healthwise/As for his health}, he is FINE. (Krifka & Musan 2012: 31) 


The frame setter carries the presence of alternatives within a particular specific CG do- 
main set out, or assumed, by the speaker. This specificity property of the CG overlaps 
with definiteness characterising the topic, captured by the [+given] property in (14). The 
frame setter therefore resembles a contrastive topic, e.g. the frame setters health wise/as 
for his health means ‘in terms of/talking about his health instead of his other situations’; 
‘his health’ is the specific CG domain within which ‘fine’ must be understood. However, 
a frame setter is not exactly the same as a contrastive topic as it might carry only some 
degree of domain specificity, not the really strong properties of definiteness and saliency 
captured by [+given] and [+salient] features exhibited in (14). We argue that the frame 
setter should be characterised as [+salient, +given, +contrast], where +given captures 
the idea of specificity and a low degree of givenness; this is further discussed in §6.3. 

In Sembiran Balinese, like in English, the frame setter occupies a clause-external left 
periphery position. This is a position left-adjoined to the maximal sentence structure of 
CP in terms of a version of the X-bar syntax in LFG adopted here; see §2, also (Arka 
2003). Sentence (44a) is an example of frame setting from Sembiran Balinese. The phrase 
structure of this sentence is given in (44b). 
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(44) a. [En buat ngamah]pyame, [icang ba]contrroc mehang iya nasi. 


if about Av.eat 1SG PART AV.give 3sG rice 
“As for eating needs, I am the person who gave him meals: 
b. 
CP 
PP CP 
(FrSetter) P dcm 
en buat ngamah, NP IP 


icang ba ` mehang iya nasi 


The predication of 'giving him rice' in (44) must be interpreted in the context of the frame 
setter of 'eating needs' instead of other needs. There is no co-referential or argument- 
dependency relation between the frame setter en buat ngamah and any element in the 
predication. The element nasi ‘rice’ is related to the frame setter in a sense through its 
semantic field, e.g. ‘food-related’ in this case. 

However, there are cases where the frame setter expression can be understood as the 
syntactic dependent of the predicate. These are cases that are traditionally known as left- 
dislocation and topicalisation, exemplified in English in (45) and (46), respectively (Foley 
2007). 


(45) a. Turtles, they make the greatest pets. (Left dislocation) 


b. Mary, I went to university with her. 


(46) a. That dish, I haven't tried. (Topicalisation) 
b. For Egbert, I would do anything. 


Left-dislocation and topicalisation are similar but different types of constructions. In 
left-dislocation, the frame setter and a syntactic dependent in the predication are related 
by means of a pronominal copy. In topicalisation, they are related through a filler-gap 
relation. In languages such as English, left-dislocation is only available for a pivot/subject. 
A topicalisation of a subject is ungrammatical, as it would leave the subject position 
unoccupied, e.g. * Turtles, — make the greatest pets (Foley 2007). 

In terms of information structure, these topicalised/left-dislocated units are con- 
trastive topics, as they carry the presence of contrasting alternatives, e.g. ‘turtles in con- 
trast to other animals as pets' in (45a). 

Example (47) illustrates left-dislocation in Sembiran Balinese. The left-dislocated topic 
(index i) is anaphorically referenced by the pronoun iya in the object position. 


(47) [Beli Dora] ;, nang tua jua ng-adep-in iya ;nyuh pluk kutus puhun. 
Beli Dora father old PART Av-sell-APPL3sG coconut eighteen trees 


‘As for Brother Dora, it was uncle who sold eighteen coconut trees to him 
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There is no clear difference between left-dislocation and topicalisation in (Sembiran) Ba- 
linese. The pronominal copy involved in left-dislocation can be dropped. Sentence (47) 
is still acceptable when iya is elided, making left-dislocation and topicalisation indis- 
tinguishable in Balinese. In addition, the overt third-person pronoun iya in (Sembiran) 
Balinese only refers to animate beings, typically human referents. A definite non-human 
inanimate referent is expressed by a zero pronoun in Balinese. Thus, when the fronted 
topic NP is associated with an inanimate entity, the structure never appears with an overt 
pronominal copy iya. This is exemplified in (48), where the fronted topic nyuh nanange 
nto ‘father’s coconut tree’ is in the left periphery position. It is the P argument of the 
verb nebus ‘av.redeem’. In its object argument position, the P argument has no overt 
realisation, indicated by a Ø. Note that it cannot be overtly realised by iya, indicated by 


(iya). 


(48) [Nyuh nanang-é ento]rop nagih beli Mudiasir buwin nebus Ø/ 
coconut father-pzr that Av intend brother Mudiasir again Av.redeem 
(“iya), nagih bayah=a ny-(c)icil Ø, ngara behang-a kén Man Jantuk. 
3sG intend pay-3 ` Av-pay NEG give-pass by Man Jantuk 
“As for Father's coconut tree, (brother) Mudiasi wanted to redeem (it); (he) 
wanted to pay it in instalments, but it was not accepted by Man Jantuk’ 


The fact associated with the definite inanimate referent, such as in (48) and other cases 
with optional iya for a human referent, shows that it is unclear whether the unexpressed 
argument is a gap (i.e. topicalisation) or a zero pronoun (i.e. left-dislocation). For these 
reasons, the term topicalisation was used for fronted topic NPs in the left peripheral 
position for both cases with or without an overt pronominal copy; however, if necessary, 
the empty position was represented by Ø to make the original position of the fronted 
NP explicit. 


6.2 Ordering of marked topic and focus 


When both the topic and focus are fronted to left periphery positions, there are immedi- 
ate questions. First, what is the constraint, if any, in terms of their order? Second, what 
does the constraint mean in terms of information structure and the broader grammatical 
system? The empirical issue in relation to the first question is addressed in this section. 
The second issue is briefly discussed in the conclusion in 87. 

When two marked discourse functions are in the left periphery positions, the order 
is the frame setter/contrastive topic first, followed by contrastive focus, as informally 
formulated in (49). This is illustrated in (44). Another example is given in (50); however, 
when the focus is a question word (QW), the fronted focus can precede the frame set- 
ter/contrastive topic (see (54)). 


(49) [XP]Frame/ ConstrTopic [XP]constroc [IP]core clause 
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(50) [(En buat) behas]p.contrrop; [iya]A Piv.ContrFoc lakar meli-ang Ø; (en buat) 


(If about)rice 3sG FUT AV.buy-APPL if about 
céléng]p.ContrTop: [cahi] A piv.ContrFoc ''gurus-ang Ø. 
pig 2SG AV.arrange-APPL 


‘As for rice, he'll buy it; as for pigs, you have to arrange them: 


In example (50), there are parallel clausal structures with their P arguments behas, ‘rice’ 
and celeng, ‘pig’ functioning as frame setters, which are also contrastive topics. Note that 
the structure is in the AV voice with the A being a pivot. The unmarked position of the 
object P is postverbal, indicated by @. 

The evidence that these sentence-initial expressions in (50) are topics is that they can 
be marked by the topic phrasal marker en buat ‘as for’. They are also frame setters, as 
they delimit the interpretation of the predication. That is, the action of buying is about/ 
in relation to rice, whereas the other arrangement is in relation to pigs. 

Another important point to note from example (50) is that the topic expressions are 
not definite. They are indefinite/generic, referring to a class of entities called behas ‘rice’ 
versus another class called celeng ‘pig’. No particular rice or pig is referred to: any rice 
or pig would do. In short, this provides evidence from Sembiran Balinese that the topic 
is not necessarily definite. 

There is evidence that the actor pivot arguments, iya and cahi, in (50) are a contrastive 
focus because these are units that can be marked by the contrastive focus relativiser (a)ne 
- an exclusive property of a pivot. The fact that only a pivot can be relativised is a well- 
known characteristic of Austronesian languages. Thus, the pivot iya can receive ane, 
marking the contrast (51a). In contrast, marking the topicalised P results in an ungram- 
matical structure, as shown in (51b). 


(51) a. [Behas]p contrTop: [iya] A Piv.ContrFoc ane lakar meli-ang. 
rice 3sG RELFUT Av.buy-APPL 
‘As for rice, he is the one who will buy it’ 
b. * [Behas]p ContrTop ane, [iya]A Piv.ContrFoc lakar meli-ang. NOT FOR ‘As for 
rice REL 3sG FUT Av.buy-APPL 
rice, he will buy it’ or ‘It is rice that he will buy: 


Reversing the order with the contrastive focus first and the frame setter/contrastive topic 
after compromises the acceptability. In contrast to (50), for example, the following is 
unacceptable: 


(52) ?* [iya]A piv.contrFoc [(en buat) behas]|p contrTop lakar meli-ang @; FOR ‘As 
3sG (if about) rice FUT AV.buy-APPL 
for rice, he is the one who will buy it’ 


Unlike the previous example in (50), the frame setter/topic in example (53a) is definite: 
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(53) a. [Ne]top [cahila pivFoc ba ane ngelahang. 
this 2sG PART REL AV.have.APPL 


“This (land), YOU are the one who owns (it)? 


b. [Ne]top ba — [cahi]a pi, (ba) ane ngelahang. 
this PART 2sG PART REL AV.have.APPL 


"THIS (LAND), YOU are the one who owns (it): 


c. “[Ne]top ba ane [cahi] A piy (ba) ngelahang. 
this PART REL 2SG PART AV.have.APPL 


“NOT FOR “THIS (land), YOU are the one who owns (it)' or 
‘Tt is THIS (LAND) that YOU are the one who owns it. 


Note that the focus appears with a particle ba marking [*contrast]. This particle can 
be associated with either a topic or a focus; hence, (53b) with both a topic and a focus 
marked by ba is possible. The topic in (53b) has a stronger contrast (indicated by placing 
its translation of THIS in capital letters) than its counterpart without ba in (53a), e.g. 
with additional affirmation in response to the addressee's question/hesitation. 

The unacceptability of (53c) provides further evidence that a contrastive focus cannot 
precede a contrastive topic in left periphery positions. In this example, an attempt is 
made to make the first NP a contrastive focus by a ba and ane marking. 

Still, it is possible to have a fronted focus before a topicalised NP in (Sembiran) Bali- 
nese. This is the case when the focus is a question word (QW). This possibility stems from 
the constraint that a fronted QW must be associated with a pivot, an exclusive property 
of the pivot argument in Balinese (Arka 2003). Consider a transitive predicate such as 
alih ‘av.search’ in a declarative sentence, as in (54a). When the A pivot is questioned, the 
QW can appear in situ, as in (54b). The P object can be topicalised, as in (54c). The QW 
can be fronted as well, as in (54d). Note that the fronted QW must be associated with the 
PIV, which is in this case the A argument (index j) because the verb is in the AV. While 
the NP Men Tiwas is closer to the verb than the fronted QW, the fronted OW must be 
associated with the pivot (index j; reading [i]), not the object (reading ii). 


(54) a. Men Sugih ngalih Men Tiwas. 
Men Sugih Av.look.for Men Tiwas 
‘Men Sugih looked for Men Tiwas: 


b. Nyen ngalih Men Tiwas? 
who Av.look.for Men Tiwas 


"Who looked for Men Tiwas?' 


c. [Men Tiwas]ryoy. ; [[nyen]goc [ngalih i Jp? 
Men Tiwas who Av look.for 
“(As for) Men Tiwas, who looked for her?' 
d. [Nyen]foc jj; [Men Tiwas]top Lj ngalih il? 
who Men Tiwas A.PIV AV.look.for P.OBJECT 


i) Who was looking for Men Tiwas?' 
ii) * ‘Whom Men Tiwas was looking for?’ 
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An adjunct can appear as a frame setter. It may also carry an emphatic or contrastive 
meaning. Consider the following example in (55), where the adverb ditu 'there' in the left 
periphery position is referentially the same as the sentence-final adjunct PP 'at Butuh 
Catra's place’. The contrastive-emphatic meaning resulting from the appearance of ditu 
‘there’ in the left periphery position is captured by the rather long free translation given 
the locative adjunct in (55). 


(55) Ba kento [ditu sa]game i [meme]stop bareng megae ` [di Butuh 
after like.that there PART mother together MID.work at Butuh 
Catra-ne ento] ;. 


Catra-DEF that 


"After that, THERE at Butuh Catra's place (i.e. not at other places), I (mother) 
work together’ 


6.3 Scope, contrast, and negation 


In this final subsection, we address the issue of the scope of focus/contrast and re- 
lated complexity due to the interaction of information with negation, topicalisation and 
pragmatic-contextual implication where local socio-cultural information might also be 
important. We begin with the different sizes focus can apply. 

Units of different sizes can be put into contrast, bearing a new focus, from a broad new 
focus covering the whole sentence (even a string of sentences) to a narrow(er) new focus 
involving smaller/lower clausal constituents, such as VPs with their object NPs, just the 
object NP or oblique PP of a VP or possibly even just the modifier part of the object NP. 
A (wide) sentence new focus (cf. Lambrecht 1994) is exemplified by the answer sentence 
in example (56) from English (Foley 1994). The same sentence in different discourse 
contexts would have a different information structure involving different units of new 
focus. If the context of the dialogue in (56) already included Los Angeles (LA) as part of 
the CG information, then LA would not be part of the new focus unit. 


(56) Q: What happened? 
A: An earthquake just hit Los Angeles. 


Of particular interest are the intricacies of the different sizes of unit being focussed and 
contrasted through negation. Negation is of particular interest because it illustrates the 
complexity of a semantic-pragmatic-syntax interface wherein there can be a mismatch 
between scope in semantics and pragmatic information structure. Consider the yes/no 
question-answer pair in (57) and its context. 


"Foley distinguishes between focus and new information. He argues that not all the new information is a 
focus, e.g. a high falling pitch in the answer sentence in (56) would indicate that the last NP Los Angeles is 
the focus (i.e. the earthquake happened just in Los Angeles not in other cities), while the whole sentence 
carries new information. Focus in Foley's sense here is equivalent to the emphatic/contrastive focus type 
captured by the [contrast] feature. 
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(57) Context: Men Dora told a story about how she gave her money to somebody but 
did not get her land certificate, and consequently, she lost her land. 


Ouestion: Bakat tanah-e? (=(15)) 
uv.obtain land-DEF 


"Did you get the land?' 
Men Dora: [ngara]goc, [ngara]goc bakat. 
NEG NEG UV.obtain 


‘No, (I) didn't get (it): 


Semantically, the negation is wide in scope, as it negates the whole sentence/proposition; 
however, in terms of information structure, the new information (i.e. new focus) does 
not cover the whole sentence/proposition. The new focus here is the negative choice 
itself. A yes/no question offers closed alternatives, and in this instance, the “no” option 
is chosen/true. The information conveyed by the other parts of the sentence is not new. 
The subject topic ‘the land’ and the A argument (‘you’) are already understood (i.e. part 
of the CG) and are therefore elided. The predication encoded by the UV verb 'obtain' is 
also presupposed information. 

A negator can often result in different scopes, possibly with ambiguity, typically when 
the negation is of the type of normal sentential negation, as exemplified in (58a). Note 
that, even though the negator ngara appears in its preverbal position in this sentence, 
reading (ii) is possible: it does not negate the predicate mati, but only the adjunct ulihan 
nyai following the verb. This reflects the narrow scope of the negation. 


(58) a. Kucit-e ngara mati ulihan nyai. 
piglet-pEF NEG die because of joer 


i. ‘The piglet was not dead because of you. 

ii. "The piglet was dead not because of you’ 
b. Kucit-e mati ngara ulihan nyai. 

piglet-DEF NEG die because.of 2sG.F 

* i. “The piglet was not dead because of you’ 


ii. "The piglet was dead not because of you’ 


Sentence (58b) exemplifies a constituent negation where the negator immediately pre- 
cedes the PP. The predicate matiis not in the negation scope; hence there is no ambiguity. 

Then, the negated unit together with its negator can appear in the left-periphery posi- 
tion to express a contrastive focus, as illustrated by the excerpt from the corpus in (59). 
In this example, the context provides the contrasting reason for the piglet's death. How- 
ever, it should be noted that, even without an explicit contrasting element as in (58b), 
the constituent negation presupposes that something else has caused the death of the 
piglet; hence the negated adjunct is contrastive focus. 


168 


5 Information structure in Sembiran Balinese 


(59) Ento kucit mati kinnya behan sakiten; [ngara [ulihan nyai]]goc kada mati. 
that piglet dead because by sickness NEG because.of 2sc.F make dead 


‘The piglet died because of its sickness; not because of you, (the thing that) 
caused its death’ 


Fronting may give rise to topicalisation, selecting a narrow scope for negation, and the 
fronted unit appears to behave like a topic. Its status as contrastive topic (or focus) is, 
however, not immediately clear. 

Consider (60), where the negator ngara appears in its position preverbally, but its 
semantic scope is narrow, due to the fronting for the object. That is, the predicate ngelah, 
‘have’ is not within its scope of negation. Based on the context, it is understood that 
the speaker might have other types of produce, but she had no coconuts—the relevant 
produce needed for the barter. 


(60) Budi luwas ke gunung ngalih sela, [nyuh]contrFoc ngara ngelah. 
want go to mountain AV.get sweet.potato coconuts NEG Av.have 


‘I was going to go to the mountain to trade for sweet potatoes, but, as for 
COCONUTS, I didn’t have any’ 


The fronted object P nyuh to the left-dislocated position is assigned [+contrast]. The 
question is whether the fronted unit is a contrastive focus, as seen in the fronted adjunct 
in the preceding example in (59), or a contrastive topic. 

We contend that it is neither; it is a frame setter. It is not really the focus, as the focus 
is in fact the narrow negation with respect to possession of ‘coconuts’. It is not really 
the topic, as it is not definite, and it is actually new as far as the immediate CG context 
is concerned; hence [-given]. 

The referent of ‘coconut’, however, could be thought of as [+salient], as evidenced by 
the fronting (a common strategy for expressing some kind of salience). It is also salient 
as far as the local socio-cultural context is concerned. That is, the fronting should also be 
understood as the speaker’s intention to express not only the bipolar sense of contrast 
in relation to the negation of coconuts, but also in relation to the socio-cultural salience 
of coconuts vs. other items of farm produce for bartering. The two senses of contrast are 
represented in (61), with (b) showing the sociocultural-economic contrast set. 


(61) a. Truth value contrast: 
{[have no coconuts’ ]Negfocus: [having coconuts’ ]}frame 


b. Local cultural-economic contrast: 
{[ no coconuts for barter’ ]yegfoc [other items for barter]}frame 


The data of the types shown above raise an important issue in the analysis of (contrastive) 
topic and frame setter in terms of the feature space outlined in (14). The challenge is how 
to capture the different degrees of specificity and salience characterising the CG. The 
CG may simply be specific in the larger context of a particular domain due to our under- 
standing of the world or due to certain local socio-cultural knowledge. Such referential 
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specificity may not render the referent of an entity definite (i.e. [+given]) but the referent 
is not totally [-given] either. We propose that such a referent bears a weak given property, 
represented by [+given] in our proposed feature space; meaning it is ‘specific’, neither 
indefinite nor definite; see the discussion of the relationship between (in)definiteness 
and specificity (Enc 1991). This issue highlights the gradient nature of the information 
structure categories. Thus, the other features [contrast] and [salient] can also be thought 
of as gradient in nature, represented in the same way, [+contrast] and [+salient] respec- 
tively. Exploring the precise implication of adding this weak dimension as another value 
to the analysis of information structure is beyond the scope of the present paper. 


7 Conclusion and final remarks 


New data on information structure from Sembiran Balinese, an endangered conservative 
mountain dialect of Balinese, has been presented. This is the first detailed study on infor- 
mation structure in this language that outlines the ways the pragmatic functions of the 
topic and focus interact with each other and with the grammatical functions in the gram- 
mar of this language. In a descriptive-empirical context, it has been shown that Sembi- 
ran Balinese employs combined strategies, exploiting the available structural positions 
(e.g. left/right periphery, parallel clausal structures) and morpho-lexical and syntactic 
resources in grammar (e.g. voice systems and particles) and general local knowledge. 
Prosody has been identified to play a role in Sembiran Balinese, but its precise role in 
information structure in this language requires further research. 

In an analytical context, the novel approach of the analysis presented is the con- 
ceptions of the topic and focus as complex notions, decomposed into three semantic- 
discourse/pragmatic features of [+/-salient], [+/-given] and [+/ —contrast]. Based on 
these features, the information structure space in Sembiran Balinese was explored. The 
investigation revealed that the three features of topic and focus interact in complex ways, 
allowing for different possibilities to characterise different subtypes of the topic and fo- 
cus in Sembiran Balinese, such as default/primary topic, secondary topic, contrastive 
topic/focus and new topic/new focus. Throughout the paper, language-specific charac- 
terisations and supporting data for these sub-types of topic and focus in Sembiran Bali- 
nese have been provided. Thus, this study has contributed to the typology of information 
structure and the framework by which such a typological study can be conducted. 

On a theoretical level, the analysis assumes a modular parallel model of grammar, as 
in LFG (Bresnan et al. 2015, among others, Dalrymple 2001) and RRG (Van Valin 2005, 
Van Valin & LaPolla 1997). There have been proposals regarding how i-str units can be 
formally and precisely mapped onto other layers of structures in grammar (King 1997, 
Mycock 2013, Butt 2014). The comprehensive classification of the discourse functions of 
the topic and focus provided in this paper can be formally utilised in existing frame- 
works. 

One theoretical point worth highlighting is the concept of prominence in linguistic 
theories, and to certain extent, in language typology. Prominence in LFG, for example, 
has played a key role in the linking/mapping theory to account for cross-linguistic pre- 
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dictability and variations in semantics-syntax interfaces. The basic principle of any the- 
ory of linking is harmonious prominence matching: most prominent items across lay- 
ers tend to require being mapped onto each other. Thus, given the widely agreed cross- 
linguistic generalisation of the prominence of A>P in semantic-argument structure and 
the pivot/subject>object in syntactic argument structure, there is a cross-linguistic ten- 
dency of A and the Subject to be mapped to each other. In this context, the prominence of 
the i-str space is included based on the proposed conception, as discussed in $3. The de- 
composition of the topic and focus into features with values allows for representing the 
gradient nature of the types of topic and focus thus far identified. Based on the analysis, 
the presence of the properties (i.e. with + value) contributes to the prominence, which 
results in the gradience shown in (62). 

Some discussion is needed for the gradience of information structure prominence cap- 
tured by (62). First, the gradience comes with two opposing ends in which the contrastive 
topic/frame setter is the most prominent category (with all features having plusses) and 
new/completive focus is the least prominent (with all features having minuses). The plus 
value should be understood as the presence of the relevant information structure prop- 
erty, possibly with its overt marking. Thus, from the speaker's perspective, a contrastive 
topic/frame setter encodes an information unit singled out as having some kind of impor- 
tance, which has been amply demonstrated in the previous discussion as having salient 
or marked structural and prosodic properties; e.g. fronted, stressed and/or marked by 
particles; see $5.5 and $6.1. Structurally, it is high in the phrase structure tree; see (5). In 
contrast, the new/completive focus is the least prominent, as evidenced from its struc- 
tural and prosodic properties in Sembiran Balinese; e.g. expressed later in the clause 
(formally low in the phrase structure tree) and linked to a less prominent grammatical 
function (e.g. P as object, rather than the pivot). 

Second, the information structure prominence captured in (62) should be understood 
as reflecting the general pattern, with typical structural and prosodic correlates as just 
mentioned. However, there is no one-to-one correlation, and there may be a case where 
focus structurally comes first and the frame setter/contrastive topic comes second. This 
is when the focus is linked to the pivot, which is the most prominent grammatical func- 
tion, as seen in the case with a fronted QW, exemplified in (54). This indicates that gram- 
matical prominence outweighs information structure prominence, at least in (Sembiran) 
Balinese. 


(62) 


1 2 3 4 FEE EO EE 
salience + + + - - -E 
given 2ub _ 4 . 
contrast + + — + EECH 
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1-a.ContrTopic; b.FrameSetter 5=NewTopic 


2-ContrFocus (fronted) 6-SecondTopic 
3-Default Topic 7-ContrNewFoc 
4-ContrSecondTopic 8-New/Completive Focus 


Third, the prominence is gradient in the sense that there are no discrete or clear bound- 
aries in the ranking of the categories in between the two ends, even though a pattern 
indicating the ranking of two subclasses is observable in (62): subgroups 2-4 vs. 5-7. Sub- 
group 2-4 appears to be more prominent than subgroup 5-7. Evidence for this in Sem- 
biran Balinese, for example, comes from the positive values of the information structure 
features correlating with the structural marking properties. Thus, the default topic comes 
structurally higher, before the verb, whereas the secondary topic comes later, after the 
verb. 

However, the prominence of member categories within each group is not always clear. 
For this reason, no boundaries are represented separating them within their own group 
(62). The labelling and ordering of the members of the second group, 5-7, are for conve- 
nience only. More research is needed to determine how new topic (5), secondary topic 
(6) and non-fronted (contrastive) new focus should be ranked with one another. 

Finally, another question regarding prominence is whether the three features are also 
ranked against each other. While a definite answer to this question requires further ver- 
ification and investigation (as the element of contrast can be achieved by means of more 
than one strategy), it appears that [+contrast] outweighs [+givenness]. Evidence for this 
can be found in the contrastive new focus in (Sembiran) Balinese, which triggers the 
linking to the pivot, as seen in example (23). That is, when the A is [+given] and P is 
[+contrast], even though it is new, it triggers the linking to pivot and can claim a posi- 
tion earlier or higher (i.e. more prominent) (phrase-) structurally. 

Prominence is a broad and important concept in typological and theoretical linguis- 
tics, and this paper has contributed to the empirical basis of this area of research. The 
notion of contextual CG is central in the information structure analysis, and this paper 
has also contributed to the empirical basis in this discourse pragmatic research by high- 
lighting the significance of the local socio-cultural information in understanding infor- 
mation structure in Sembiran Balinese. Languages vary in terms of coding resources, and 
this paper has contributed to descriptive linguistics, showing how linear order and con- 
stituency, voice system, and other grammatical-lexical resources interact to convey com- 
plex and subtle communicative meanings. More research is required to uncover whether 
similar patterns and complexities are encountered in the neighbouring languages, and 


beyond. 
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Abbreviations 
1, 2,3 first, second and NEG negator 
third person pronouns NML nominaliser 
A actor NOM nominative 
ACC accusative P most patient-like argument 
APPL applicative of transitive verbs 
ART article PART particle 
AV actor voice PERF perfect 
CAUS causative Piv Pivot 
CG Common Ground POSS possessive 
DEF definite RED reduplication 
Foc Focus REL relativiser 
FUT future S subject of intransitive verb 
h.r. high register SG singular 
LIG ligature Top Topic 
MID middle voice UV undergoer voice 
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Chapter 6 


Constituent order and information 
structure in Indonesian discourse 


Dwi Noverini Djenar 
The University of Sydney 


This study draws on theories of information structure to examine the findings from Cum- 
ming's (1991) study showing changing preferences in constituent order in Indonesian. Cum- 
ming found that predicate-initial clauses, an important grammatical resource for encoding 
events in Classical Malay, are diminishing in Indonesian, and agentive clauses are now pre- 
ferred. Based on data from fictional discourse and television reports, three clause structures 
are examined: [meN-V], [di-V-nya] and [ia V]. I show that the use of [meN-V] to denote 
successive events and to mark the climactic portion ofa story episode illustrate most dramat- 
ically the preference for agentive clauses in Indonesian. I also argue that [di-V-nya] remains 
an important resource for encoding events but its pragmatic function seems to have weak- 
ened. This clause type is now mainly used to mark a new focus, drawing the addressee's 
attention to a particular event or series of events. Meanwhile, objective voice [ia V] is also 
used to encode events; however, unlike [di-V-nya] which is structurally different from [meN- 
V], the use of [ia V] alongside agentive clause structure [ia meN-V] creates an impression of 
structural symmetry and can serve two goals simultaneously: marking event and signalling 
a new focus. It could be that, as the pragmatic force of [di-V-nya] is weakening, [ia V] is 
increasingly preferred for marking focal events. 


1 Introduction 


In her work on constituent order in Malay, Cumming (1991) makes an important ob- 
servation regarding syntactic change in this language, arguing that the predominant 
VSO order in Classical Malay has gradually given way to SVO order in Indonesian and 
Malaysian.! She points out that predicate-initial clauses, an important grammatical re- 
source for encoding "eventiveness" in Classical Malay, are diminishing in Indonesian, 
and agentive clauses are now preferred (Cumming 1991: 199). Eventiveness, in her term, 
refers to "a related class of phenomena having to do more with characteristics of an event 


1Cumming uses the term “Modern Indonesian" to refer to contemporary Standard Indonesian, and ‘Malay’ 
to the language of which Indonesian and Malaysian national languages are varieties (1991: 2). 
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than with characteristics of its participants"; while some phenomena "are primarily re- 
lated to the inherent semantics of the event”, others “have more to do with the way the 
event is presented" (Cumming 1991: 123). Eventiveness is also associated with high dis- 
course transitivity and event sequentiality (Cumming 1991: 161-162). Cumming (1991: 176) 
notes that in Indonesian, predicate initial clauses with passive morphology such as [di- 
V-nya] are still used to encode eventiveness but eventiveness is not the only motivation 
for this passive morphology. Such clauses tend to be "especially eventive" (Cumming 
1991: 176). Clauses with pre-predicate patient indicate that the patient is treated as "top- 
ical", that is, it has either been mentioned in the immediately preceding clause or a few 
clauses back and needs recalling (Cumming 1991: 176).? 

In this study, I examine Cumming’s findings in light of more recent data from standard 
Indonesian.’ My goal is to demonstrate that, although the data support Cumming's ob- 
servation, the preference in Indonesian is for events to be encoded with [meN-V] clauses, 
and that predicate initial clauses such as those encoded with [di-V-nya] are marked, there 
seems to be a further development since her study. I focus my discussion on the interac- 
tion between three types of clause structure - agentive clauses with [meN-V], [di-V-nya] 
‘passive’ clauses, and objective clauses with [ia V] - to show the following. First, [meN- 
V] clauses are indeed preferred for encoding various events, including successive events, 
irrespective of whether they are performed by the same agent or different agents. Cum- 
ming (1991: 175) states that in Indonesian, [di-V-nya] passive morphology is reserved 
for “especially eventive" clauses that mark the “climactic portion" of a story episode. 
The data in this study show that [meN-V] clauses are also used for this purpose. These 
provide strong support for Cumming's finding that the preference in Indonesian is for 
agentive clauses. 

Second, the function of [di-V-nya] appears to be weaker now than Cumming found in 
her data. Events encoded by [di-V-nya] in the present data do not need to be "especially 
eventive" or to occur in the climactic portion of a story episode; rather, [di-V-nya] is 
mainly used to signal a new focus. A switch from [meN-V] to [di-V-nya] invites the 
addressee/reader to turn their attention to a different event or series of events. The switch 
may indicate a shift from a series of descriptive clauses to an eventive one, or from a 
series of eventive clauses to a different event or series of events. 

Third, in addition to [di-V-nya], another type of clause is also employed to mark a 
new focus, namely objective clause with free third person pronoun ia ‘3sc’. The objective 
clause structure [ia V] is often preceded and followed by [ia meN-V] agentive clauses. A 
switch from [ia meN-V] to [ia V] signals a new focus, similar to a switch from [meN-V] 
to [di-V-nya], but whereas [di-V-nya] marks a structural difference from [meN-V], the 


?*Topical" in Cumming’s (1991) sense is slightly different from “topic” in theories of information structure 
adopted here. Cumming's use of the term considers the tracking of a referent across clauses and narrative 
episodes, and the position of the participant relative to other participants in narrative, whereas topic in the 
information structure theories is defined at clause or sentence level. Nevertheless, in both cases, a topic 
referent is understood as one that has been previously mentioned and hence can be presupposed. 

3Cole et al. (2008) distinguish between standard Indonesian that is the formal variety of the language and 
the “prescriptive standard" variety, i.e., the highly institutionally prescribed variety. No such distinction is 
made in this study. 
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use of [ia V] objective clause alongside [ia meN-V] agentive clause creates an impression 
of structural sameness. In both cases, the third person pronoun ia '3sc' is preverbal, thus 
highlighting the continuity of the third person as agent (or actor in intransitive clause) 
through explicit mention of the agent. In events encoded by [meN-V] clauses, agent 
continuity tends to be indicated with zero. 

I draw on theories of information structure advanced by Lambrecht (1994), Gundel 
(1999), Gundel et al. (1993) and Gundel & Fretheim (2004) to demonstrate how a switch 
from [meN-V] clauses to [di-V-nya] or [ia V] can be explained as a shift to a new focus. 
As will be shown, a switch to a different grammatical construction functions to draw the 
addressee's attention to a particular action or series of actions the agent is performing 
which cannot presupposed from the previous clauses. Drawing on the notion of “topic” 
in these theories is also useful for explaining why one participant, and not another, is 
referred to with zero in a series of [meN-V] clauses as well as what the relation is between 
the topic participant and the focal elements in the clause. More generally, examining the 
functions of the different clause structures in terms of information structure is helpful 
for explaining why clauses occur in the structures that they do. 

Most of the data for this study are taken from six Indonesian novels. One of the novels 
was published in 1977, and the others between 2002 and 2012. I chose fiction rather than 
other types of text based on the following considerations. First, fictional texts contain 
numerous instances of clause chains and different clause structures, including [di-V-nya] 
clauses which may be rare in other text types, thus enabling us to examine the interaction 
between the different clause structures. Second, Cumming's findings on Indonesian were 
based on fictional texts, hence using data from the same text type but a different period 
is helpful for showing in what respects her findings are current and how they can be 
extended. I include examples from television news broadcast to show that the arguments 
I offer are also relevant for non-fictional texts. 

This chapter is structured as follows. $2 provides a description of the different clause 
structures to be analysed and introduces Cumming’s (1991) “trigger” system. $3 provides 
an overview of "topic" and “focus”, key concepts in information structure, to facilitate 
the analysis of the three Indonesian clause types in $4, $5, and 86 respectively. The study 
concludes in 87 with a summary of findings and suggestions for further research. 


2 Agent trigger and patient trigger clauses 


In her study, Cumming (1991: 29) discusses different clause structures in terms of “trig- 
ger”. She distinguishes three types of clause: the intransitive clause, the “agent trigger" 
(AT) clause and "patient trigger" (PT) clause, the latter two being types of transitive 
clauses. "Trigger" is a syntactic role label for "the participant which is a) obligatory, and 
b) functions as the shared argument or ‘pivot’ in clause combining" (1991: 31), and whose 
semantic role "triggers" the verb morphology. In the AT clause, the trigger has the se- 
mantic role of agent, while in the PT clause, it has the semantic role of patient. I adopt 
her terminology here. 
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In Indonesian AT clauses, the agent typically precedes the verb and the verb is gener- 
ally prefixed with meN-, as shown in (1). 


(1) Mel meng-ambil kimono. 
Mel MEN-fetch kimono 


"Mel fetched (her) kimono? (Nuranindya 2009: 14) 


PT clauses are of two types. The first has [di-V] verbal morphology. In this clause type, 
the agent, when expressed, occurs in an adjunct phrase typically marked by oleh ‘by’, as 
shown in (2). This type of PT clause is also referred to in the literature as "passive" or 
"passive type 1" (Sneddon et al. 2010). 


(2) Sepasang suami istri yang sedang pergi ke tahlilan keluarga di Cirebon 
one couple husband wife REL in.processgo to tahlilah family Loc Cirebon 
di-serang oleh geng motor. 
prattack by gang motor 
‘A husband and wife, who went to a family tahlilan event in Cirebon, were 
attacked by a motorcycle gang "7 (Fokus Pagi, Indosiar, 9/11/2015) 


A related clause type is one where [di-V] is followed by the cliticised third person agent 
-nya '3sc'*. The enclitic -nya is attached to the verb rather than to the preposition oleh 
‘by’ in an adjunct phrase (olehnya ‘by her/him/it’). In example (3), [di-V-nya] occurs with 
pre-predicate patient. 


(3) Dan setelah pamili ini agak berada sedikit, kamar itu di-pakai-nya sebagai 
and after family this rather wealthy a.little room that Di-use-3sG as 
kandang anjing yang bor `  di-beli-nya. Lit. ‘And after the family became a 
kennel dog REL recently pr-buy-3sG 
little wealthy, the room was used by him/her as a kennel for the dog they had 


bought: 

"After the family became a little wealthy, they used the room as a kennel for the 
dog s/he had just bought: (Idrus, cited in Kaswanti Purwo 1988: 204; my 
translation) 


[di-V-nya] clauses also occur with post-predicate patient, as in (4). 


(4) Babuitu me-lihat air yang berkilauan. [...]. Kemudian di-ambil-nya abu, 
maid that MEN-see water REL BER-shine then DI-get-3SG ash 
di-siram-kan pada air kencing dan [...]. 

DI-pour-APPLLOC water urine and 

"Ihe maid looked at the shining liquid. [...]. Then she took some ashes and 
sprinkled (them) on the urine and [...]? (Pramoedya Ananta Toer 1963: 25-26; 
cited in Kaswanti Purwo 1988: 219; translation from Kaswanti Purwo 1988: 219)? 


^Tahlilan is a Muslim social gathering to commemorate the dead. 
5For consistency, I use my own glossing in this example rather than the original. 
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As Cumming's (1991) and Hopper's (1979) studies show, [di-V-nya] is a common device 
for marking foregrounding in Classical Malay. Foregrounding clauses, in Hopper's defi- 
nition, are clauses denoting "events belonging to the skeletal structure of the discourse" 
(Hopper 1979: 213). These are clauses that move a story along temporally, in contrast to 
backgrounding material that do not. Cumming (1991: 123) uses the term "eventiveness" 
rather than "foregrounding" to place less emphasis on the participants and more on the 
characteristics of the event. The frequent use of this structure in Malay texts has led to its 
conventionalisation as a structure denoting an action performed by some third person 
agent. Kaswanti Purwo (1988: 205) suggests that [di-V-nya] is "the type of di- used to 
describe a series of chronologically ordered punctiliar actions by a single actor". This is 
certainly true of example (5) below. 


(5) Di-cuci-nya muka-nya, kemudian di-lap-nya kering-kering, lalu kembali ke 
pI-wash-3sc face-3sG then DI-wipe-3sc dryly thenreturn to 
kamar-nya. 
room-3sG 


‘She; washed her face;, she; wiped [ ]; dry, then [ ]; returned to her room: 
(Krisna 1977: 119) 


However, [di-V-nya] is not the only construction that can be used to encode successive 
actions by a single actor, as will be shown later in this chapter. In Indonesian, [di-V- 
nya] is found mainly in written discourse, particularly in older fictional narrative (the 
example in (5) is taken from a novel published in 1977). As we saw in (4), [di-V] is also 
used to encode such actions. This example is from a collection of short stories published 
in 1963 by acclaimed writer Pramoedya Ananta Toer. In more contemporary texts, this 
is more likely to be rendered in either in [meN-] AT clauses or a combination of AT and 
PT clauses, as will be discussed in 84, 85 and §6. 

The second type of PT clause contains an unprefixed verb with the agent in pre- 
predicate position, either in the form of a clitic, such as in ku- ‘1sc’ in (6), or a free 
pronoun, such as ia ‘3sc’ in (7).° This type of PT clause is better known in the literature 
as "objective voice" - henceforth OV (Arka & Manning 1998; Cole et al. 2008; Kroeger 
2014). This study is mainly concerned with OV clauses containing the third person pro- 
noun ia. 


(6) Ku-guyur seluruh tubuh-ku dengan air dari keran wudu. 
isG-pour.water entire body-1sc with water from tap wudu 


‘I poured water on my body with water from the wudu tap’® (Kumalasari 2006: 
12) 


The agent may also be proper name or kin term. 
"Beside ia, the third person pronoun dia is also used in Indonesian. The latter is not discussed in this study. 
8 Kran wudu is the tap Muslims use to get water from to wash their face, hands, and feet in preparation for 


prayer. 
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(7) Buku-buku yang semula berserakan di | atas tempat tidur, meja dan lantai 
book-RDP REL previously scattered Loc top place sleep table and floor 
ia masuk-kan?ke dalam kotak kardus. Lit. “The books which were 
3SG putin-APPL DIR inside box cardboard 
previously scattered on the bed, table and floor, he put in cardboard boxes: 
‘He put the books which were previously scattered on the bed, table and floor in 
cardboard boxes: (Kurniawan 2002: 195-196) 


Cumming (1991: 34) uses the following shorthand to refer to the different types of con- 
stituent order: V>T for clauses in which the predicate occurs before the patient, and T-V 
for those the patient precedes the predicate (in her terms, "predicate before trigger" and 
"trigger before predicate" respectively). She categories OV clauses with pre-predicate 
agent such as in (6) as PT clauses and not AT clauses even though the agent is in pre- 
predicate position (and the order of the constituents is therefore A V P). Her argument 
for doing so is that in this type of construction, the preverbal agent occupies the same 
slot as the prefix di- in passive. Nevertheless, as I argue later in this study, the choice of 
the free pronoun ia instead of the enclitic —nya is pragmatically and stylistically mean- 
ingful. 

The foregoing examples show that events can be encoded in different ways, such with 
[meN-V] AT clause, [di-V] or [di-V-nya] PT clauses with either pre-predicate or post- 
predicate patient and OV clause with proclitic or a free pronoun as agent. Given these 
different possibilities, what motivates the use of PT clauses and what is the functional 
difference between PT clauses with T>V order and that with V>T order? As Cumming 
argues, PT clauses with post-predicate patient (V>T order) are an important grammatical 
resource for encoding eventiveness in Classical Malay while pre-predicate patient (T>V 
order) in PT clauses indicates the topical status of the patient. I argue that in contexts 
where both AT and PT clauses are used, a switch from AT clauses to a PT clause marks a 
shift in focus. By switching to a PT clause, the writer signals that there is new information 
and the reader is invited to direct their attention to it. In the next section, the notions of 
"topic" and "focus" are explained. 


3 Topic and focus 


"Topic" and "focus" in the sense of Lambrecht (1994), Gundel (1977), Gundel (1999), Gun- 
del & Fretheim (2004), and Gundel et al. (1993) are relational pragmatic categories. Topic 
is defined in terms of aboutness: "the thing which the proposition expressed by the sen- 
tence is ABOUT" (Lambrecht 1994: 118; emphasis in original). The term "topic" designates 
the "entity" or "the discourse referent" about which the proposition is construed (Lam- 
brecht 1994: 127). A topic must be referential, individuated (Gundel & Fretheim 2004: 
187), and cognitively active, i.e., it is mentioned in the sentence or clause, and not merely 
inferred. In the example below (from Lambrecht 1994: 127; capitalisation in original), the 


?Indonesian has two applicative suffixes, -kan and - i. In this study I treat [di-V-kan] and [di-V-i] as a subset 
of [di-V]. 
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expressions ‘Pat’, ‘she’, and ‘her’ all refer to the same entity, and that entity is construed 
as being what the propositions are about. The referent of these topic expressions is iden- 
tifiable through the mention of the proper name ‘Pat’ and the third person pronominal 
forms ‘she’ and ‘her’. 


(8) a. Pat said they called her Twice. 
b. Pat said she was called TWICE. 
c. Pat said SHE was called. 
d. Pat said they called HER. 


Van Valin & LaPolla (1997: 204) explain that topics “either name a topic referent in the 
discourse, or they are simply involved in the expression of a semantic relation between 
a topic referent and a predication.” Topics that have the former function are generally 
coded as lexical NPs, while those in the latter function, are “most often” coded as zero 
or unstressed pronouns. In terms of syntactic coding for topic and focus, Van Valin & 
LaPolla (1997: 205) provide a useful scale of acceptability, shown below in Figure 1. 


Markedness of occurrence as focus 


E 
Clitic / Definite Indefinite 
Zero bound pronoun Pronoun Pronoun NP NP 


[-stress] [*stress] 


Markedness of occurrence as topic 


Figure 1: Coding of referents in terms of possible functions (Van Valin & LaPolla 
1997: 205) 


This figure shows that zero coding is the least marked coding for a topic, while indefinite 
NP is the least marked coding for a focus. According to Van Valin & LaPolla (1997: 205), 
“while indefinite NPs can be topics under special contextual circumstances, it is impos- 
sible for a focal element to be zero”. Zero correlates with the referent’s cognitive status 
as active (Givón 1975: 379). 

Focus, like topic, is defined relationally; focus is a relational notion that “determines 
the main predication in the sentence, that predication being assessed relative to topic" 
(Gundel & Fretheim 2004: 190). Lambrecht (1994: 213) defines it as "the semantic com- 
ponent of a pragmatically structured proposition whereby the assertion differs from the 
presupposition". The difference between a pragmatic presupposition and a pragmatic as- 
sertion has to do with the difference between what the hearer is assumed to know at the 
time of hearing the sentence, and what s/he is expected to know as a result of hearing 
it, as spelt out below (Lambrecht 1994: 52; capitalisation in original). 


PRAGMATIC PRESUPPOSITION: the set of propositions lexicogrammatically evoked in a 
sentence which the speaker assumes the hearer already knows or is ready to take 
for granted at the time the sentence is uttered. 
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PRAGMATIC ASSERTION: the proposition expressed by a sentence which the hearer is 
expected to know or take for granted as a result of hearing the sentence uttered. 


The relation between topic and focus is one between “pragmatically non-recoverable to 
the recoverable component of a proposition". Focus creates a new state of information 
in the mind of the addressee (Lambrecht 1994: 218), thus is often conflated with Chafe's 
(1994) notion of “new information" in his discussion on referent accessibility; however, 
as Lambrecht (1994: see chapter3) stresses, the two are not the same. 


(9 Q: When did you move to Switzerland? 


A: When I was SEVENTEEN. 
(Lambrecht 1994: 48, 217) 


The stressed element in the answer is identified as the focus. What makes the proposition 
informative is not the fact that the person was seventeen years of age at some point in 
her life (which is the pragmatic presupposition in this example), but that s/he moved 
to Switzerland at the time she was seventeen (the pragmatic assertion). The assertion 
(focus) thus stands in relation to the topic (the person moving) and the action of the topic 
referent (i.e., moving to Switzerland). Similarly, in (10), the referent of 'Bill' is cognitively 
active (it is mentioned in the question Q). However, Bill is the focus here because in 
the presupposition ‘speaker saw x’ where “x = Bill’, it is “new” information. This is an 
example of what Gundel & Fretheim (2004: 182) call “contrastive focus". 


(10) Q: Did you see John or Bill? 
A: Bill. 
(Van Valin & LaPolla 1997: 205) 


Lambrecht (1994: 221-235) distinguishes between argument-focus, predicate-focus, and 
sentence-focus structures. Argument-focus structure is a sentence in which the focus is 
the missing argument in an open proposition (similar to “narrow focus" in Givón 1975). 
This is illustrated in (11). Here the focal element 'car' is the argument of the proposition. 


(11) Q: I heard your motorcycle broke down? 


A: My car broke down. 
(Lambrecht 1994: 48, 223) 


In predicate-focus structure, the predicate is the focus, and the subject with all its ele- 
ments are in the presupposition. 


(12) Q: What happened to your car? 


A: My car/it broke pown. 
(Lambrecht 1994: 48, 223) 


In sentence-focus structure, it is both the subject and predicate in the sentence (minus 
any topical, non-subject elements) that constitute the focus (e.g., in presentative con- 
structions). 
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(13) Q: What happened? 


A: My car broke down. 
(Lambrecht 1994: 48, 223) 


Alllanguages have some grammatical means for marking topic and focus, including mor- 
phosyntax, prosody, word order, or any combination of these (Van Valin & LaPolla 1997: 
201). With regard to Indonesian PT clauses, Cumming (1991: 175) notes that the position 
of patient relative to the predicate provides an indication of its topical status (or “the- 
matic" in Kaswanti Purwo's 1988: 226 term). Pre-predicate position suggests topicality, 
while post-predicate position indicates the patient is “not made prominent" and “not 
highlighted". As will be shown in $5, PT clauses are marked and they are not always 
used to indicate a climactic point in a story. Agentive [meN-V] clauses can also be used 
to mark a climactic portion, as will be shown in $4. Drawing on the notions of focus and 
topic helps us explain the significance of pre-predicate patient by taking into account 
its relation to the predicate, as well as the significance of the referent of the patient 
argument in the discourse. 


4 MeN- clauses 


One ofthe most important findings in Cumming's (1991) study is that modern Indonesian 
strongly shows a preference for AT clauses. A recent study by Shiohara (2015) supports 
this finding. Using short videos showing a young person performing a series of actions 
(e.g., breaking an egg into a bowl, whisking the egg, adding some milk, then whisking it 
again) for data elicitation, Shiohara's study shows that of all the transitive clauses found, 
88.5% were AT, while the remaining clauses were rendered in PT (8.6% in di- passive 
and 2.9% in objective voice). Shiohara's study tells us that the preference for [meN-V] 
clauses is most dramatic when successive actions by the same agent are involved. It also 
shows that [meN-V] primarily occurs in eventive clauses, contrary to Kaswanti Purwo's 
(1988: 226) claim that they tend to be used for backgrounding. In what follows I show 
that the preference for [meN-V] persists even when there are multiple referents that are 
accorded the status of agent and a switch in the semantic function of the participant is 
involved (cf. Foley & Van Valin 1984). Moreover, [meN-] AT clauses are also used to mark 
the climactic portion of a story episode, unlike in older texts where PT is preferred for 
this purpose. 

Two examples from the novel Lelaki Harimau ‘Man Tiger’ (Kurniawan 2004) show 
how a series of AT clauses can be used to encode a series of actions performed by two 
different participants, with a switch in the semantic function of one participant - from 
patient to agent - but without the accompanying change in the verbal morphology. The 
excerpt in (14) is included here to provide a context for the AT examples in (15) and to 
show the switch from PT to AT. The protagonist, Margio, has just killed the man who 
had courted his mother and made her pregnant but refused to marry her. Margio is subse- 
quently captured for his deed and detained at the local military office. The information 
about his arrest is rendered in PT with T>V order, signalling to the reader to pay at- 
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tention to the event that describes the capture. The referent ‘Margio’ is shown as the 
"topical" patient, as indicated by its pre-predicate position (Cumming 1991: 175). As will 
be further discussed in the next section, PT clauses are commonly used to attract the 
reader's attention to a new focus. 


(14) Kini ketika malam telah runtuh ke bumi mengapungkan bintang-bintang dan bu- 
lan sepotong tergantung enggan, lampu-lampu dinyalakan di pelataran rumah dan 
pinggir jalan, dan codot tak lagi tampak beterbangan disebabkan hitam yang meng- 
hapus hitam tubuhnya, Margio diseret Joni Simbolon ke rayon militer. Selalu 
begitu sebelum seseorang dibawa ke kantor polisi, sebab tanpa itu para prajurit tak 
punya lagi keriangan di dunia republik yang tak ada perang. 


‘Now, when night has fallen on the earth making the stars appear floating and a 
piece of moon is hanging reluctantly, the lights lit on the front yards and along the 
streets, and the bats no longer flying around as the darkness hides the blackness 
of their bodies, Margio is being dragged by Joni Simbolon to the local military 
office. That always happens before someone is taken to the police station, because 
otherwise, the soldiers would no longer be able to have fun in a republic where 
there is no war happening: (Kurniawan 2004: 37; my translation)! 


In the immediately following discourse, shown in (15), the author switches to AT to de- 
scribe two successive events depicting what Margio's captors did to him. In the first 
clause, the agent (Joni Simbolon and his friends) is referred to with the third person plu- 
ral pronoun mereka ‘they’, while the patient (Margio), with the enclitic -nya “3sc’. These 
pronominal forms indicate the presupposed status of the two participants. In the second 
[meN-V] clause, the referent of “Joni Simbolon (and his friends)' is coded as zero (reflect- 
ing its topic status), while ‘Margio’ is mentioned with the enclitic -nya '3sc' again. These 
two [meN-V] clauses are followed by a switch in the semantic function of the patient par- 
ticipant. Margio, coded as zero, is now actor of the intransitive clause meringkuk di satu 
dipan 'curled up on a wooden bed' and agent of the subsequent AT transitive clauses. 


(15) Mereka me-ngurung-nya di dalam sel, meng-ganti pakaian-nya dengan 
3PL MEN-cage-3SG Loc inside cell MEN-change clothes-3sc with 
seragam hitam bau kapur barus dan lemari kayu, me-ringkuk di satu 
uniform black smell chalk barus.tree and wardrobe wood MEN-curl.up LOC one 
dipan meng-hadap-i susu hangat yang tak di-cecap-nya, meng-hadap-i 
wooden.bed MEN-face-APPL milk warm REL NEG DI-taste-356  MEN-face-APPL 
sepiring nasi ikan tongkol yang tak di-lumat-nya. 
one.plate rice fish tuna REL NEG Dr-devour-3sc 
"They; put him; in a cell, [ ]; changed his; clothes into a black uniform smelling 
of mothballs and wooden wardrobe, [ ]; curled up on a wooden bed [ ]; facing (a 


10Tn this translation I provide as close a translation as possible to the original to facilitate the discussion. This 
is slightly different from the published English translation (see Kurniawan 2015; translated by Labodalih 
Sembiring). 
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glass of) warm milk he; didn't drink, [ ]; facing a plate of tuna rice he; didn't 
devour? (Kurniawan 2004: 37; my translation) 


In this excerpt, the referent of 'Margio' is treated as highly continuous; he is mentioned 
with the enclitic -nya '3sc' in the first two clauses, followed by zero in the subsequent 
three (main) clauses. In the final two clauses, the reader is told that Margio is not touch- 
ing the food given to him. The author uses the same predicate in these clauses and also 
relative clauses, creating parallel structures of the kind [V NP yang tak di-V-nya]. As dis- 
cussed later in this chapter, parallelism is an important stylistic resource fiction authors 
often draw on to produce a rhythmic effect. 


(16) a. meng-hadap-i susu hangat yang tak di-cecap-nya 
MEN-face-APPL milk warm REL NEG DI-taste-3SG 
‘face (a glass of) warm milk he didn’t drink’ 
b. meng-hadap-i sepiring nasi ikan tongkol yang tak di-lumat-nya 
MEN-face-APPL one.plate rice fish tuna REL NEG DI-devour-3sG 


‘face a plate of tuna rice that he didn’t devour’ 


Below is a summary of the different clauses and arguments (expressed arguments are 
italicised, while the zero is rendered in square brackets). 


1. Margio; is captured by Joni Simbolon and his friends;. 


N 


. They; put him; in a prison cell. 


LA 


. [ ]j changed his clothes to a prison uniform. 
4. []; curls up on the wooden bed. 
5. [ ]; faces a glass of warm milk he didn't drink. 


6. [ ]; faces a plate of tuna rice he didn't devour. 


We can see in this example that [meN-V] AT clauses are the preferred construction 
for encoding agency even when there are multiple agents and the agency is switched 
between different human referents. In addition, stylistic considerations are also impor- 
tant in the presentation of events. The string of [meN-V] clauses and the repeated [NP 
yang tak di-V-nya] relative clauses create a rhythmic effect and an impression of quick, 
successive actions. Though the last three AT clauses are not foregrounding clauses, the 
rhythmic effect remains and this is due to the repetition of meN- verbs (meringkuk ‘curl 
up’ and menghadapi ‘face something’, the latter being used twice in succession) and the 
relative clauses in (16). 

Another example showing [meN-V] used in succession is given in (17). This example 
describes a physical clash between Margio's sister, Maesa Dewi, and a luminous shadow. 
These two participants are overtly mentioned in the initial AT clause. 
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(17) Maesa Dewi; me-lihat bayangan cemerlang; yang tak di-kenal-i-nya;, 
Maesa Dewi MEN-see shadow brilliant REL NEG DI-recognise-APPL-3SG 
me-mancar mem-beri silau kepada mata-nya, men-desak-nya; ke belakang 
MEN-shine MEN-give blinding toward eye-3s6 ` MEN-force-3sG to back 
dan suatu benturan meng-hantam kepala-nya; saat itu []; me-nentang 
and INDEF knock  MEN-whack ` head-ss6 moment that MEN-hit 
dinding pintu di belakang, []; meng-henti-kan-nya; sejenak sebelum []; 
wall door roc back MEN-stop-APPL-3SG for.a.moment before 
me-rosot dan tumbang ke lantai. 
MEN-slide.down and collapse to floor 


‘Maesa Dewi; saw a luminous shadow;, which she; did not recognise, [ Jj shone 


brightly, giving her eyes a blinding light, [ ]; pushed her; backwards and a knock 
hit her head at that moment (until) [ ]; knocked the back door, [ ]; stopped her 
momentarily before [ ]; slid down and [ ]; fell to the floor" (Kurniawan 2004: 34) 


The chain of events with the different participants and their semantic roles is summarised 
below (as previously, expressed participants are shown in italics, and the zero is indicated 
as square brackets). 


1. Maesa Dewi; (Agent) saw a luminous shadow; (Patient). 
2. [ ]; (Agent) shone brightly, giving her; eyes (Patient) a blinding light. 
3. [ ]; (Agent) pushed her; (Patient) backwards. 


4. [ ]; (Agent) knocked the back door (Patient). 


on 


. [ ]; (Agent) stopped her; (Patient) momentarily. 
6. [ ]i (Undergoer) slid down. 


7. [ ]; (Undergoer) fell to the floor. (note: this event is encoded with a bare verb.) 


With the exception of the first two clauses, the clauses in this example are eventive, as 
indicated by the punctual verbs. The referent of ‘Maesa Dewi’ is presented as agent in 
the initial clause, then patient and undergoer in the following clauses. The use of zero in 
subject position throughout this long example potentially creates referential confusion 
and slows down processing: is it the shadow or Maesa Dewi that slid down and collapsed? 
However, this potential confusion is resolved by the discourse context. In the subsequent 
clauses, Maesa Dewi is described as lying down as if frozen, unable to speak and her 
memory hazy. All the clauses in this story episode are rendered in [meN-V] AT clauses 
and most of them denote quick, successive actions by different agents/actors. In this 
example, as with example (15), the repetition of [meN-V] produces vividness and a sense 
of immediacy as well as a rhythmic effect. It seems that for the author, creating this 
kind of effect is more important in this long sentence than avoiding potential referential 
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confusion. These examples provide evidence that in contemporary Indonesian, [meN-V] 
serves the kind of pragmatic function previously encoded with [di-V-nya]. 

The next two examples are from television news reports aired in 2015 and 2017, show- 
ing [meN-V] AT clauses marking the climactic portion of a story episode. In these ex- 
amples, a PT clause with [di-V] is used to mark an event that leads to the climax and 
one subsequent to it. The first example of [meN-V] occurs in (19). This example follows 
directly from the sentence in example (2), repeated below as (18) for convenience. When 
this news item was broadcast, the sentence was read like a headline about a husband and 
wife couple being attacked by people in a motorbike. This headline is rendered in PT. 


(18) Sepasang suami istri yang sedang pergi ke tahlilan keluarga di Cirebon 
SE-couple husband wife REL in.process go to tahlilah family Loc Cirebon 
di-serang oleh geng motor. 
pi-attack by gang motor 


‘A husband and wife, who went to a family tahlilan event in Cirebon, were 
attacked by a motorcycle gang. (Fokus Pagi, Indosiar, 9/11/2015) 


Following the headline, a series of chronological events were reported, as shown in (19). 
The first clause contains a [di-V] PT clause with pre-predicate patient, the second has 
a [meN-V] clause with zero agent, and the third, another [di-V] clause also with pre- 
predicate patient. The agentive [meN-V] clause marks the climax of the story, while the 
first and third PT clauses inform the addressee of the event that leads to and follows this 
climax respectively. The pre-predicate patient (pasangan suami istri itu ‘the husband and 
wife couple’) in the PT clauses indicates its “topical” status. 


(19) Naas pasangan suami istri itu di-pepet oleh kendaraan 
bad.luck couple husband wife that pr-press.against by vehicle 
bermotor lalu me-nyerang dengan senjata tajam, kemudian sang istri 
have.engine then MEN-attack with arm sharp then DEF wife 
di-tusuk. (Lit. 'Bad luck, the husband and wife couple was pressed against by a 
DI-stab 
motorbike then attack with a sharp object, after which the wife was stabbed’) 


“Bad luck for the husband and wife couple;, a motorbike; approached them; and 
[ ]j attacked [ ]; with a sharp object, then stabbed the wife.’ 
(Fokus Pagi, Indosiar, 9/11/2015). 


A similar example is shown in (20) but with the post-climactic event rendered in an 
intransitive clause marked by the prefix ber-. This example is taken from a news item 
about a crime committed against Novel Baswedan, a senior investigator for Indonesia's 
Komisi Pemberantasan Korupsi ‘Commission for the Eradication of Corruption’. Novel 
was reported as walking home from the nearby mosque at five am when two people in 
a motorbike rode towards him and threw acid on his face. The initial [di-V] PT clause 


1 Tahlilan is a Muslim social gathering to commemorate the dead. Pak is a short form of bapak, a kin term 
used to address or refer to one's father and adult males in general. 
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alerts the addressee to new information, while the [meN-V] marks the eventive clause 
and the climax of the story.12 The pre-predicate patient in the PT clause indicates the 


patient's "topical" status. The following [ber-V] clause describes what happens after the 
climax. 


(20) Tiba-tiba Novel di-datang-i dua orang bersepeda motor dan langsung 
suddenly-rpp Novel pi-come-APPL two person on.bicyle motor and immediately 
me-nyiram-nya dengan air keras. Novel lalu berteriak dengan 
MEN-pour-3sa with water hard Novelthenscream with 
sekencang-kencangnya. 
as.loud.as.possible-RDP 


‘Suddenly Novel; was approached by two people; on a motorbike and 
[ ]j threw acid on bim, Novel screamed as loudly as he could. (Eksklusif: 
Wawancara Novel Baswedan, Aiman, Kompas TV, 3 July 2017) 


To summarise, this section has discussed, first, the use of [meN-V] clauses to encode 
a series of events involving different agents. In the examples shown, vividness and a 
sense of immediacy are created through successive use of [meN-V] in both transitive 
and intransitive clauses involving different referents in subject position (in the role of 
either agent, actor or undergoer). I discussed two examples from the same novel, so 
one might argue that the preference for [meN-V] clauses is particularised to the author 
of that novel Oe, Eka Kurniawan). However, as Cumming (1991) and Kaswanti Purwo 
(1988) have amply shown through their examples, the use of [meN-V] or [di-V-(nya)] 
clauses to denote successive actions is common in Indonesian fiction. What seems to 
be a new development is the strong preference for AT even when agency is switched 
among participants. This section has also shown that [meN-V] clauses can mark the 
climactic portion of a story episode, with non-eventive clauses marked by [di-V] and 
[ber-V] preceding and following them. This is the opposite of the kind of pattern in 
older texts studied by Cumming and Kaswanti Purwo, where PT is the preferred clause 
type for marking climax and AT for backgrounding. 


5 Di-V-nya and topic-focus distinction 


Some have argued that [di-V-nya] clauses with post-predicate patient are ergative (see 
Hopper 1979: 232-233; Arka & Manning 1998: 14). However, others disagree (e.g., Cum- 
ming & Wouk 1987; Cumming 1991: 33-34; Kroeger 2014). The debate will not be entered 
into here; in this study I follow Cumming (1991) in considering [di-V-nya] as a type of 
PT clause. One of the common environments in which [di-V-nya] appears in Indonesian 
is where it is preceded and followed by AT clauses. A switch from AT clauses to a [di- 
V-nya] clause signals a shift to a new focus and serves as an invitation to the addressee/ 
reader to take notice. This is exemplified in (21). 


Tn this example, the verb contains the applicative suffix —i. As mentioned, this is treated as [di-V]. 
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(21) Almashira; mem-buka gorden biru muda bermotif ^ vineyard, me-natap-i 
Almashira MEN-open curtain blue light have.pattern vineyard MEN-gaze-APPL 
tetes hujan deras yang mem-basah-i jendela kamar-nya yang ter-letak 
drop rain hard REL MEN-wet-APPL window bedroom-3sc REL TER-position 


di tempat yang paling private di rumah-nya, alias di loteng. 
Loc place REL most private Loc house-3sc none.other.than Loc upstairs 
Di-ambil-nya; sweter-nya, yang gombrong dan nyaman, lalu 

DI-get-3sG sweater-3sc REL loose and comfortable then 


me-makai-kan-nya; di tubuh-nya yang langsing. 

MEN-put.on-APPL-3sG LOC body-3sc REL slim 

"Almashira,; opened the light blue curtain with a vineyard motif, stared at the 
raindrops falling on the window of her bedroom which is located in the most 
private section of her house, namely upstairs. She; grabbed her loose and 
comfortable sweater;, and put it; on her slim body. (Karina 2008: 7) 


Here, the initial clause describes the protagonist Almashira opening the curtain in her 
bedroom and subsequently staring at the raindrops falling on her window. Almashira 
is the agent in the AT clauses in this sentence. The first clause (referring to Almashira 
opening the curtain) is eventive, while the second (Almashira staring at the raindrops) 
is a process (descriptive). In the second sentence, a switch to PT with di-V-nya occurs. 
This switch marks a shift from a non-eventive clause in the previous sentence (Almashira 
staring at the raindrops) to an eventive one (Almashira fetching her sweater). The switch 
to [di-V-nya] also coincides with a new sentence. The [di-V-nya] clause is then followed 
by a return to AT. In this AT clause, the agent, which is mentioned in the PT clause 
with the clitic -nya ‘3sc’, is now rendered with zero, indicating the referent's status as 
topic. Meanwhile, the patient argument (the referent of sweternya ‘her sweater’) is now 
rendered also with the clitic -nya ‘3sc’. 

Cumming (1991: 175) suggests that a switch to PT is usually associated with the "climac- 
tic portion" of a story episode and creates a feeling of vividness or immediacy. However, 
this argument does not apply to (21). The excerpt is taken from the opening paragraph 
of the novel Circa (Karina 2008) and this is the first time that the reader is introduced 
to the protagonist and her actions. The switch from a descriptive clause (Almashira star- 
ing at the rain drops) to the [di-V-nya] clause (Almashira fetching her sweater) draws 
our attention to new information but does not indicate a climactic point in the episode. 
Moreover, the patient cannot be considered topical either as it has not been mentioned 
previously. The PT clause merely introduces a new event that contrasts from the descrip- 
tive clause in the previous sentence. Once this event has been introduced, a switch back 
to AT with zero agent follows. 

Compare (21) with the example below from the novel Senja di Jakarta by Mochtar 
Lubis published in 1970 (quoted without gloss in Cumming 1991: 174) where [di-V-nya] 
marks the climactic portion of the story episode. Here Dahlia's actions of arranging Has- 
nah's hair, taking a lipstick from her bag and applying it on Hasnah's lips form successive 
actions and all are rendered in PT. The beginning and end of the episode are rendered in 
AT. 
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(22) Dahlia memegang rambut Hasnah, mengambil sisir, dan asyik mengerjakan ram- 
but Hasnah. Mula-mula Hasnah membantah, akan tetapi tidak diperdulikan Dahlia. 
Setelah rambut Hasnah disusunnya, diambilnya cat bibir dari tasnya, dan di- 
gincunya bibir Hasnah. Dia mengambil kaca dari dinding, memegang kaca di 
depan Hasnah, dan berkata, "Nah, lihat, kan cantik?" 


"Dahlia grasped Hasnah's hair, took a comb, and zealously did Hasnah's hair. At 
first Hasnah rebelled, but (she) was ignored by Dahlia. After she had arranged 
Hasnah's hair, she took a lipstick from her bag and painted Hasnah's lips. She 
took a mirror from the wall, held the mirror before Hasnah, and said, “Now, look, 
aren't you pretty?” 


Although the beginning and end of example (21) are also marked by AT, the sense of 
immediacy generated by the series of quick, successive actions in (22) is largely absent. 
In (21), the use of [di-V-nya] merely signals to the reader that the protagonist is now 
performing an action that contrasts with the one before. I would argue that [di-V-nya] 
with post-predicate patient in both examples marks a new focus. By using this clause 
structure, the author is making a pragmatic assertion (about what action the agent is 
performing and in relation to what kind of patient referent), expecting the reader to 
know what the proposition is only after reading the clause. This assertion cannot be 
presupposed; so, for example, in (21) Almashira fetching her sweater cannot be presup- 
posed after reading about her opening the window and staring at the raindrops. Using 
[di-V-nya] with V>T order is a grammatically effective way of alerting the reader to the 
new information. 

In a similar example, shown earlier in (5) and repeated below as (23) with a fuller 
context, a switch from AT to PT with V>T order is found. Here a series of AT clauses is 
followed by a series of PT clauses. As with the previous examples, the switch from the 
AT to PT (the PT clauses are underlined in the English translation) does not mark the 
climactic part of a story episode but rather, a shift in focus. 


(23) Rosna me-langkah ke dalam, mem-buka pintu tengah, lalu pergi menuju 
Rosna MEN-step to inside MEN-open door middle then go toward 
kamar mandi setelah ia meng-ambil handuk yang ter-gantung di sampiran. 
room bath after 3sGMEN-fetch towel REL TER-hang Loc clothesline 
Di-cuci-nya muka-nya, kemudian di-lap-nya kering-kering, lalu kembali ke 
Di-wash-3sc face-3sc then DI-wipe-3sc dryly then return to 
kamar-nya. 
room-3sG 
‘Rosna stepped inside, opened the door in the middle (of the room), and then 
headed towards the bathroom after she fetched a towel that was hanging on the 
rail. She washed her face;, then she wiped [ ]; dry, after which [ ]; returned to 
her room: (Krisna 1977: 119) 


Here, Rosna is described as performing a series of actions (stepping inside, opening 
the middle door, fetching a towel and going towards the bathroom), and these actions 
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are rendered in AT clauses with meN- verbs and an intransitive clause with a bare verb 
(pergi menuju kamar mandi “headed towards the bathroom"). The switch to the [di-V- 
nya] PT clause marks the beginning of a different series of actions (i.e., washing and 
drying the face). These are actions for the purpose of which the preceding series of ac- 
tions are taken (i.e., Rosna walking to the bathroom and fetching a towel on the way 
are performed for the purpose of washing and drying her face). It is in this way that 
we can understand the PT clauses as being narratively significant. In the first PT clause, 
the patient mukanya 'her face' cannot be presupposed (Rosna could have washed her 
hands rather than her face, for instance). Furthermore, the actions denoted by the PT 
clauses, although significant, cannot be considered climactic. The excerpt describes the 
protagonist Rosna performing the series of actions in preparation to speak to Benda, the 
man she has been wishing to date. Although the actions of washing and drying the face 
are significant within the context of the episode where Rosna is concerned to making 
herself presentable to Benda, they are only part of the series of events that lead to the 
climax. Later in the episode, the reader is told that Benda, who is waiting for her in the 
living room while Rosna freshens herself, decides to leave suddenly and not engage ro- 
mantically with her, leaving Rosna deeply disappointed. It is that part of the episode in 
my view that marks the climactic portion. 

It is possible to render the focal action(s) rendered in PT in (21) and (23) as AT clauses, 
as shown in the reconstructed examples in (24) and (25), but these clauses merely denote 
a series of chronological events rather than signal a shift to a different event (or a new 
series of events) the author wants the reader to pay particular attention to. 


(24) Ia mengambil sweter-nya; yang gombrong dan nyaman, lalu 
3sG MEN-take — sweater-3sc REL loose and comfortable then 
me-makai-kan-nya; di tubuh-nya yang langsing. 
MEN-put.on-APPL-3sG LOC body-3sc REL slim 
‘She grabbed her loose and comfortable sweater, and put it on her slim body’ 
(adapted from Karina 2008: 7) 


(25) Ia men-cuci muka-nya, kemudian me-lap kering-kering, lalu kembali 
3sG MEN-wash face-3sG then MEN-wipe-3sc dryly-àpP ` then return 
ke kamar-nya. 
to room-3SG 


‘She; washed her face;, then wiped [ ]; dry, after which [ ]; returned to her room: 
(adapted from Krisna 1977: 119) 


In the original examples we saw earlier in (23), it is the entire PT clauses, and not indi- 
vidual elements within them, that are treated as focus. This exemplifies what Lambrecht 
(1994: 233) calls "sentence focus", the type of focus where the scope extends to the entire 
clause, and not only the predicate or an argument. To explain how the focus is deter- 
mined, let us turn to example (2) again, repeated below without the gloss. 
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(26) Almashira; membuka gorden biru muda bermotif vineyard, [ ]; menatapi tetes 

hujan deras yang membasahi jendela kamarnya yang terletak di tempat yang paling 
private di rumahnya, alias di loteng. Diambilnya; sweternya yang gombrong dan 
nyaman, lalu [ ]; memakaikannya; di tubuhnya; yang langsing. 
"Almashira; opened the light blue curtain which a vineyard motif, [ ]; gazed at 
the drops of water coming from the pouring rain that is wetting her bedroom 
window which is located in the most secluded part of her house, namely upstairs. 
She grabbed her loose and comfortable sweater, then [ ]; put it; on her; slim body’ 
(Karina 2008: 7) 


Here we have the protagonist, Almashira, mentioned in the initial clause as opening the 
curtain in her bedroom. This clause is rendered in a [meN-V] AT clause. The referent 
of the name Almashira is rendered as zero in the following [meN-V] clause, indicating 
that it is now treated as topic. In the [di-V-nya] PT clause that follows, the referent is 
referred to with enclitic —nya, while in the subsequent “resumptive” [meN-V] AT clause, 
itis rendered as zero, once again indicating her status as topic. In this example, the entire 
PT clause is the focus because the event denoted by it adds new information. The patient 
sweternya yang gombrong dan nyaman 'her loose and comfortable sweater' cannot be the 
only focal element although it adds new information to the presupposition 'the person 
fetched x’, where ‘x = sweternya yang gombrong dan nyaman’, because saying so does 
not explain why [di-V-nya] occurs, and why the patient is in post-predicate position 
(V>T order). It is grammatically and semantically acceptable to have the patient placed 
in pre-predicate position: Sweternya yang gombrong dan nyaman dipakainya ‘She put on 
her loose and comfortable sweater’ (T>V order). However, the switch to the PT clause 
puts the highlight not only on the sweater but also on the fact that Almashira is grabbing 
it and putting it on, thus it is the event that is focal, not only the patient argument. The 
PT clause is a grammatical device that informs the reader to direct their attention to a 
new focus. In this sense, it is similar in function to presentatives marked by particle lah 
in Classical Malay (see Cumming 1991: 90 for examples). Cumming points out that the 
use of lah to mark presentativeness is greatly diminished in Indonesian. It could be that 
switching from AT clauses to a PT clause is one mechanism that Indonesian writers use 
for this purpose. 

The focal status of the PT clause can be shown by using an adapted version of the tests 
in Lambrecht (1994: 223). A focus element can occur on its own as an open proposition. 
Thus below, the entire PT clause can occur as the open proposition in (27a), but the 
predicate alone cannot do so, as shown in (27b). 


(27) a. Sentence-focus 
Q: Apa yang terjadi? “What happened?’ 
A: Diambilnya sweternya. “She grabbed her sweater: 
b. Predicate-focus 
Q: Apa yang terjadi pada sweternya? “What happened to her sweater?’ 
A: DIAMBILNYA sweternya. ‘She grabbed: 
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Applying the same tests to the first PT clause in example (5) and (23), repeated below 
without the gloss, the open proposition in (29a) makes sense, while (29b) and (29c) which 
put the stress on dicucinya ‘she washed’ and mukanya ‘her face’ respectively, are awk- 
ward. The proposition in (29b) would be more appropriately responded to with an AT 
clause (Dia MENCUCI mukanya ‘She WASHED her face’); similarly, a more suitable re- 
sponse to (29c) would be to front the patient (MUKANYA yang dicucinya 'It is HER FACE 
that she washed"). This suggests that the focus ofthe first PT clause extends to the entire 
clause rather than being limited to the predicate or the patient argument. 


(28) Dicucinya; mukanya;, kemudian dilapnya; kering-kering, lalu kembali ke kamar- 
nya. 
‘She; washed her face, she; wiped [ J; dry, then [ ]; returned to her room’ (Krisna 
1977: 119) 


(29) a. Sentence-focus 
Q: Apa yang terjadi? “What happened?’ 
A: Dicucinya mukanya. “She washed her face? 
b. Predicate-focus 
Q: Apa yang dilakukannya? “What did she do?’ 
A: DicUCINYA mukanya. “SHE WASHED her face: 
c. Argument-focus 
Q: Apa yang dilakukannya? ‘What did she do?’ 
A: Dicucinya MUKANYA. ‘She washed HER FACE. 


The question that remains now is: what is the status of the second di-V-nya clause (dilap- 
nya kering-kering ‘she wiped it dry’)? I propose that in this case too, the focus extends 
to the entire clause. Here, the patient in the preceding clause (mukanya ‘her face’) is 
now presupposed and occurs as zero, indicating its topic status. If it were expressed, 
the clause would appear as dilapnya mukanya kering-kering or dilapnya kering-kering 
mukanya, both meaning ‘she wiped her face dry’. 

The topic status of the patient mukanya ‘her face’ can also be shown, for example, by 
applying a combination of left- and right-dislocation test (cf. Gundel & Fretheim 2004: 
186). Gundel & Fretheim (2004: 186) state that “the structure most widely and consistently 
associated with topic marking is one in which the constituent referring to the topic of the 
sentence is adjoined to the left or right of a full sentence comment/focus”. I apply a com- 
bination of left- and right-dislocation to test the topic status of the referent of mukanya 
‘her face’. The resulting sentence is well-formed though, I submit, it would be more likely 
to occur in spoken language rather than written language (as Gundel & Fretheim (2004: 
188) also note with regard to right dislocation construction in Norwegian). 


(30) a. Dilapnya kering-kering. 
‘She wiped [it] dry’ 
b. Kalau mukanya, dilapnya kering-kering, mukanya itu. 
“As for her face, she wiped it dry, that face of hers? 
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One might argue that the topic in the clause dilapnya kering-kering is the referent of 
the enclitic -nya '3sc' Ge, the referent of ‘Rosna’), and not the referent of mukanya ‘her 
face’. However, as mentioned, di-V-nya is conventional in the sense that this structure 
presupposes an action performed by some third person agent, therefore -nya '3sc' is 
presupposed as its structural element, not as the topic of the PT clause. The same left- 
/right-dislocation topic test therefore does not apply. The resulting sentence below is 
nonsensical (note: in the test below, the enclitic must be rendered as a free pronoun in 
the ‘as for’ phrase for well-formedness). 


(31) a. Dilapnya kering-kering. 
‘She wiped [it] dry: 
b. Kalau dia, dilapnya kering-kering, dia itu. 
‘As for her, she wiped [ ] dry, she is. 


Treating [di-V-nya] clauses with V>T order (i.e., PT clauses with post-predicate patient) 
in terms of sentence focus sits well with Hopper's (1979) argument that this structure 
marks foregrounding in Classical Malay. The series of [di-V-nya] clauses with V>T order 
in (28) and also those in (22) denote focal events that move the stories along the temporal 
axis. By using a series of [di-V-nya] clauses the authors invite the reader to pay particular 
attention not to a single new event in the narrative, as is the case with a switch from AT 
clauses to a single PT clause, but rather to sustain their attention throughout a series of 
events. To strengthen this argument we can compare with a switch from AT clauses to a 
single PT clause in (32). This example describes a young man feeling shocked at seeing 
a torn picture of an ex-girlfriend who had left him. The initial main clause contains the 
serial verbs tertegun melihat ‘stunned in seeing’. The [di-V-nya] PT clause marks a shift 
from these process verbs to an event (picking up the picture). The return to an AT clause 
after the PT clause marks a return to a process (the agent staring at the picture). (Note: 
as indicated in the subscript, the referent of —nya in dipungutnya “he picked up' is not 
coreferential with that in menatapinya “he stared at it' in the following clause.) The PT 
clause marks an event, while the AT clauses occur as descriptive clauses. 


(32) Cowok berparas kutu buku ini tertegun me-lihat salah.satu potongan 
guy have.the.look lice book this stunned MEN-see oneof cut 
gambar yang terasa begitu familiar di ingatan-nya. Perlahan di-pungut-nya; 
picture REL feel so familiar Loc memory.3sG slowly pr-pick.up-3sG 
ujung kertas itu, me-natap-i-nya; lama dengan berbagai luapan 
corner paper that MEN-stare.at-APPL-3sG long with ^ various explosion 
emosi bermain di refleksi ^ mata-nya yang nanar. 
emotion play Loc reflection eye-3s6 REL wild 


"Ihis guy with the bookish face was stunned (at) seeing a piece of a picture 
which felt so familiar in his memory. Slowly he picked up the corner of the 
paper, [ ] stared at it for a long time with all sorts of emotions reflecting in his 
wild eyes: (Karina 2008: 9) 
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Here, as in (28), the PT clause dipungutnya ujung kertas itu ‘he picked up the corner 
of the paper’ is focal and its domain spans the entire clause. In the “resumptive” AT 
clause, the agent occurs as zero while the patient is expressed as -nya '3sc'. However, 
this AT clause with zero agent sounds rather awkward, but this is possibly due to the 
fact that the situation is narrated in the third person but incorporates the point of view 
of the character. The relative clause yang terasa begitu familiar di ingatannya ‘which 
feels so familiar in his mind' is told from the point of the view of the narrator but the 
situation is viewed from the perspective of the character (as indicated by the scalar/ 
intensifier adjective begitu familiar ‘so familiar’). This technique of presenting point of 
view, known as 'free indirect discourse' (see e.g., Cohn 1978; Fludernik 1993; Maier 2014), 
is widely used in fictional texts. 

To summarise, I have argued in this section that the switch from an AT clause to a 
[di-V-nya] PT clause marks a shift in focus, signalling a new event (or a series of new 
events) that contrasts with what occurs before. The next section shows that [di-V-nya] 
is not the only PT form used for this purpose. The OV structure [ia V] serves a similar 
function but with an added pragmatic function of creating an impression of structural 
symmetry. 


6 PT clauses with ia, repetition, and parallelism 


In this section, I consider the interaction between OV clauses with the free third person 
pronoun ia '3sc', [ia V], and AT clauses containing the same pronoun [ia meN-V] to 
show that, similar to the switch from [meN-V] AT clauses to a [di-V-nya] clause (or a 
series of clauses), the switch from [ia meN-V] to [ia V] marks a shift of focus. The use 
of [ia V] also serves an additional function, namely producing a “synchronising” effect 
between the OV and AT clauses through an appearance of structural similarity. In both 
of these clause types, the agent ia “3sc’ is in pre-predicate position. When ia ‘3sc’, either 
in [ia V] or [ia meN-V], is repeated across clauses, the repetition creates an impression 
of parallel structures. 

Consider example (33). Here Alma, the sister of the guy with the bookish face men- 
tioned in (32), is described as sympathetic to her brother's situation and doing her best 
to console him. An AT clause with [ia meN-V] occurs in the first sentence, followed by 
[ia V] OV clause in the first clause ofthe second sentence. Subsequent clauses in this sen- 
tence are rendered in [meN-V] once again but with zero agent. In the previous examples 
we saw a PT clause being preceded and followed by AT clauses; here we see a similar 
pattern of AT clauses preceding and following an OV clause, marking the OV clause as 
focal. 


(33) Sesaat ia juga merasakan emosi yang sama. Dengan perasaan tak menentu, ia 
palingkan wajahnya, merenung, lalu kembali menepuk punggung abangnya 
dengan ceria. 


‘For a moment she; also felt the same emotion. Feeling uncertain, she; turned 
away, [ ]; thought for a while, and then [ ]; tapped her brother on the back cheer- 
fully? (Karina 2008: 9) 
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The referent of ia in the OV clause ia palingkan wajahnya ‘lit. she turned her face away’ 
is coreferential with ia in the preceding sentence ia juga merasakan emosi yang sama 
‘she also felt the same emotion’; in both cases the referent is presupposed and treated 
as topic. In this clause, it is the entire clause ia palingkan wajahnya that constitutes the 
focus. The NP patient wajahnya ‘her face’ cannot, on its own, be treated as the focus 
because palingkan ‘turn away’ and wajahnya ‘her face’ are in relations of entailment 
(memalingkan ‘turn away’ entails memalingkan wajah ‘turning (one's) face away’). Thus 
in the test for focus shown in (34), interpretation (34a) makes sense, while (34b) is odd. 


(34) la paling-kan wajah-nya. 
3sG turn.away-APPL face-3sG 
‘She turned away: 


a. PRAGMATIC PRESUPPOSITION: ‘the person did x’ 
PRAGMATIC ASSERTION: “x = turn away’ 


b. PRAGMATIC PRESUPPOSITION: ‘the person turned away x’ 
PRAGMATIC ASSERTION: “x = her face’ 


The occurrence of the free pronoun ia '3sc' in the AT and OV clauses is significant not 
only because the pronoun is the controller for the zero in the following [meN-V] clauses, 
but importantly, because the repetition of this pronoun in the different clause structures 
creates “stylistic alignment" through structural parallelism. Repetition, as Tannen ar- 
gues, is a common device for promoting a heightened sense of involvement in spoken 
and written discourse, and “rhythmic synchrony” is an important and widespread fea- 
ture in both conversational and written fictional discourse (Tannen 2007: 32). Repetition 
and parallel structures in texts also aid processing (Carlson 2002). By using the [ia V] OV 
clause rather than [di-V-nya] PT clause, the author thus achieves several goals simultane- 
ously: signalling a new focus, creating a rhythmic effect and thus promoting heightened 
involvement by stylistically aligning the OV clause with the preceding and following AT 
clauses, and highlighting the continuity of the third person agent. The series of clauses 
in this example are reproduced below. 


(35) | ia; merasakan emosi yang sama ‘she; felt the same emotion’ 
ia; palingkan wajahnya ‘she; turned her face away’ 
[ ]; merenung ‘[ ]; thought for a while’ 


[ ]; menepuk punggung abangnya ` '[ ]; tapped her brother's back’ 


It is important to mention here that repetition and parallel structures can be achieved 
not only by aligning [ia meN-V] with [ia V]. In the earlier examples, we saw that the 
repetition of [meN-V] AT clauses and [di-V-nya] PT clauses to denote a series of events 
by the same agent can also produce a similar effect. What is interesting about [ia V] 
clause in (33) is that, the effect is achieved through an "impression of sameness" between 
this OV clause and the [ia meN-V] AT clause. This sameness is created by the repetition 
of the free pronoun ia '3sc' in pre-predicate position. While multiple occurrences of the 
same clause structure denote a series of actions performed in quick succession by the 
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same agent (Cumming 1991: 174; also see Kaswanti Purwo 1988: 225), the pragmatic and 
stylistic effects that result from the use of different clause structures, to my knowledge, 
have not been discussed. Taking these effects into account helps us understand why 
different grammatical structures are chosen, not only what types of events they encode. 

Both Cumming (1991: 175-178) and Kaswanti Purwo (1988: 222-228) have noted that 
the position of the patient in PT clauses tells us something about its information status. 
The patient in [ia V] OV clauses can occur in post-predicate position, as seen earlier in 
(33), or in pre-predicate position, as in (7). According to Kaswanti Purwo (1988: 222), one 
of the environments in which pre-predicate patient can occur is where there is a gap 
between the last mention of the NP and the current mention.’ Cumming (1991: 177) also 
notes that the patient in this position is treated as "topical", i.e., the referent has been 
mentioned *in the immediately previous clause or several clauses back, and is of some 
importance" (1991: 179). Both of these observations are borne out in an example from the 
acclaimed novel Cantik itu Luka ‘Beauty is a Wound’ by Eka Kurniawan, given below. I 
include the excerpt in (36) to give a context for the OV clause V>T order in (37). 

The excerpt begins with the description of Dewi Ayu, the protagonist, waking early 
to go to the toilet. A wealthy woman who made her money from working as a prostitute, 
Dewi Ayu was taken to a prison camp along with others from her village when the 
Japanese came to Indonesia. Not wanting to lose her precious jewellery, she decided 
that the best way to safeguard it was to swallow some of the pieces (six gold rings) and 
she buried the rest in the broken toilet at the back of her house. She did this before 
leaving for the camp. At the camp, she safeguards the rings by retrieving them from her 
faeces when she defecates and swallowing them again afterwards. The [ia meN-V] AT 
transitive clauses occur twice. 


(36) Dewi Ayu yang terbiasa bangun pagi sekali untuk buang air segera bergegas ke 
toilet, namun antrian panjang telah menunggu. [....] Cara terbaik adalah mengambil 
air dengan kaleng margarin Blue Band-nya, dan pergi ke halaman belakang sel. Di 
sana, di antara pohon ketela yang entah ditanam siapa, ia menggali tanah seperti 
seekor kucing, dan berak di lubangnya. Setelah cebok dengan menyisakan sedikit air, 
ia mengorek tainya untuk menemukan keenam cincinnya. Beberapa perempuan 
lain melihat cara beraknya yang buruk, dan menirunya dalam jarak yang cukup 
berjauhan; mereka tak tahu ia punya harta karun. 


“Dewi Ayu, who is used to getting up very early to pass stool, quickly rushed to 
the toilet, but a long queue was already waiting [....] The best alternative is (for her 
to) get water using the Blue Band margarine tin, and go to the yard behind the cell. 
Over there, between cassava trees planted by some unknown person, she dug the 
ground like a cat, and defecated in the hole. After rinsing herself and leaving some 
water aside, she poked through her faeces to find her six rings. Some women saw 


P According to Kaswanti Purwo (1988: 222), the other two environments in which the patient occurs pre- 
verbally are a) when two NPs are contrasted and b) when an NP is promoted from oblique position to 
argument position. 
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how bad her manner of passing stool was, and [ ] imitated her from a distance; 
they didn't know she had some treasure’ (Kurniawan 2002: 67; my translation) 


Immediately following (36) is the sentence with an OV clause and pre-predicate agent, 
shown in (37). This clause is followed by another a [meN-V] AT clause but with zero 
agent. 


(37) Cincin-cincin tersebut ia; cuci dengan sisa air, dan 
ring-RDP aforementioned 3sc wash with leftover water and 
me-nelan-nya; kembali. 

MEN-swallow-3sc again 


‘She washed the rings with the remaining water, and [ ]; swallowed them; 
again: (Kurniawan 2002: 67) 


In the OV clause, the pre-predicate patient cincin-cincin tersebut “the rings' is the topic. 
It occurs as a definite NP and the referent is presupposed, having been mentioned in the 
previous discourse. Its topic status can be shown for example, by applying the same left- 
and right-dislocation test from Gundel & Fretheim (2004) that we applied to the previous 
examples. 


(38) Kalau cincin-cincin tersebut, ia cuci dengan sisa air, cincin-cincin itu. 


‘As for the rings, she rinsed (them) with the remaining water, those rings: 


The OV predicate ia cuci dengan sisa air is focal in relation to the topic expression cincin- 
cincin tersebut ‘the rings’. The test below shows that the predicate can stand alone as 
an open proposition in answer A, whereas the [ia V] phrase in answer B is slightly odd 
when it occurs on its own as a response to the question Q. 


(39 Q: Apa yang terjadi pada cincin-cincin tersebut? 
"What happened to the rings?’ 
A: Ia cuci dengan sisa air. 
‘She washed (them) with the remaining water: 
B: a cuci. 
‘She washed (them): 


This analysis supports the idea that the relation between topic and focus is in some 
respects like that between "topic" and “comment” (Gundel & Fretheim 2004: 175) in topic- 
comment constructions. The [ia V] OV clause is similar to topic-comment in that the 
pre-predicate patient (cincin-cincin tersebut ‘the rings’) is the topic, and the elements to 
its right (ia cuci dengan sisa air ‘she washed with the remaining water’) are a comment 
about the topic. Also the pre-predicate patient is "topical" in Cumming's sense in that the 
referent is accorded some importance in the episode, and this is reflected in its syntactic 
position. As indicated in (36), the six rings are significant in relation to the protagonist; 
they are valuable belongings the protagonist wants to protect. They are also important in 
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the subsequent episode where the reader is told that the rings saved Dewi Ayu financially. 
After being released from the camp, she was able to survive by pawning them and the 
rest of the jewellery she could retrieve. 

The “topical” status of the patient referent (in Cumming’s sense) can also be inferred 
from the previous discourse and does not need to have been explicitly mentioned in 
prior discourse. The excerpt in (40) describes the character Kliwon tidying up his room 
in preparation for leaving home. The example is taken from an episode where Kliwon is 
depicted as a young, budding left-wing activist about to leave his hometown to take up 
university study in Jakarta. Among his belongings are books that are scattered around 
his room. The pre-predicate patient buku-buku ‘books’ is the topic of the second sentence. 
The referent of this NP is presupposed by virtue of its association with the patient NP 
in the preceding sentence (semua barang-barangnya ‘all his belongings’) and inferable 
from a mention in prior discourse where the reader learns that Kliwon has been given a 
book by his socialist mentor. 


(40) la telah me-ngemas semua barang-barang-nya, dengan harapan jika ia pergi 
3SG PERF MEN-pack all belonging-RDP-3sG with hope if 3sc go 
me-ninggalkan-nya, semua dalam keadaan | rapi. Buku-buku yang semula 
MEN-leave-3sc/PL all roc condition neat book-RpP REL previously 
berserakan di atas tempat tidur, meja dan lantai ia masukkan ke dalam kotak 
scattered Loc top place sleep table and floor 3sc putin to inside box 
kardus dan me-numpuk-nya rapi di sudut kamar. 
cardboard and MEN-stack.up-3sG/PL neat LOC corner room 


“He has packed up all his belongings, with the hope that if he has to go away 
leaving them behind, all would be in a neat condition. He put the books which 
were lying on the bed, table and floor in cardboard boxes and [ ] stacked them up 
neatly in the corner of the room’ (Kurniawan 2002: 195-196) 


The example begins with an eventive AT clause containing completive marker telah. Two 
successive events follow, encoded by eventive OV clause with pre-predicate patient and 
eventive AT clause respectively. The pre-predicate position of the patient buku-buku 
‘books’ suggests that the switch to OV is made mainly to direct the reader's attention 
to a new focus. After this, the author resumes his use of AT. As with (37), the multiple 
occurrences of ia create an impression of parallel structures, and together with the zero 
in the final clause, highlight the continuity of the human agent. 

To summarise this section, the foregoing discussion has shown [ia V] OV clauses can 
be used to introduce a new focus, similar to [di-V-nya] PT clauses discussed in the pre- 
vious section. In both cases, the clause highlights the agentivity of the human topic 
referent. The difference is that, the repetition of the free pronoun ia ‘3sc’ across OV 
and AT clauses gives a "synchronising" effect; that is, it makes both [ia V] OV clauses 
and [ia meN-V] AT clauses appear structurally similar. In contrast, the occurrence of 


14 Agentivity is understood here in the sense of Cruse (1973: 21) as referring to “an action performed by an 
object which is regarded as using its own energy in carrying out the action", where "object" includes "living 
things, certain types of machine, and natural agents". 
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a [di-V-nya] PT clause among [meN-V] AT clauses marks the event encoded by it as 
structurally different. One could argue, then, that [di-V-nya] is more marked than [ia V]. 
Further research could support this argument with quantitative evidence. 


7 Conclusion 


In this paper I have discussed different clause structures - [meN-V] AT clauses, PT 
clauses with [di-V-nya] and [di-V], and [ia V] OV clauses (considered here as a type 
of PT) - to show that in Indonesian, eventiveness can be encoded by means of any of 
these structures. Thus the preference for AT clauses in Indonesian cannot be understood 
narrowly as a preference for eventive clauses, or that eventiveness is coded only by AT 
clauses. The foregoing analysis shows that each clause type has similar as well as differ- 
ent functions, summarised as follows. First, [meN-V] clauses used to denote successive 
events, either involving the same agent or different agents in transitive clauses and actor 
or undergoer in intransitive clauses, and to mark the climactic portion of a story episode, 
illustrate most dramatically the preference in Indonesian for agentive clauses in SVO or- 
der. Kaswanti Purwo (1988: 226) points out that meN- verbs tend to occur in subordinate, 
backgrounding clauses. Meanwhile, Cumming (1991: 203) found that the “basic” clause 
type in Indonesian novels is AT with T>V order; PT clauses with V>T order, though 
still used to encode eventiveness, are marked. As we saw, [meN-V] occurs in both even- 
tive/foregrounding and backgrounding clauses. Precisely whether and how [meN-V] AT 
clauses are used to encode eventiveness in Indonesian is a question that needs to be ex- 
plored further. For now, it will have to be sufficient to say that these clauses can be used 
to encode both. 

Second, PT clauses with [di-V-nya] remain an important resource for encoding even- 
tiveness in Indonesian but their function is now mainly to signal a new focus, draw- 
ing the addressee/reader's attention to a particular event (or series of events) that con- 
trasts with the event or state that comes before. In 84 I showed that [di-V] clauses occur 
with pre-predicate patient, indicating the patient's "topical" status. It may be that these 
clauses are mainly used to highlight patient continuity, while [di-V-nya] is primarily 
used to emphasise a particular event the agent is performing. Further research would be 
able to confirm this early observation. 

Third, OV clauses with [ia V] structure are also used to encode eventiveness. Its co- 
occurrence with [ia meN-V] AT clauses emphasises the continuity of the third person 
agent and creates an impression of structural symmetry. The repetition of ia ‘3sc’ across 
two different clause types produces a rhythmic effect much desired in fictional texts. 
To what extent rhythm and structural symmetry are also important considerations in 
other types of texts (e.g., news reporting) is an interesting topic that can be explored in a 
larger study. Such a study would also be able to inform us whether, with the weakening 
in pragmatic force of [di-V-nya], [ia V] is now preferred for marking focal events. 

The majority of the data on which this study is based are taken from fictional nar- 
rative. Whereas Shiohara (2015) tested Cumming's findings using experimental data, I 
have drawn on fictional narrative to approximate the kind of data that Cumming used 
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in drawing her generalisations. The results presented here might invite questions re- 
garding genre; that is, the arguments I have raised might be objected to as valid only 
for narrative discourse. However, I hope to have demonstrated that they are also rele- 
vant for television news discourse. Finally, stylistic preferences can vary greatly between 
language users, as Cumming (1991: 174-175) herself has also noted. By including news 
discourse and fictional texts by different authors I hope to have shown that the choice 
of constituent order is not just a matter of individual style. 
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Abbreviations 

1 first person LOC locative 

2 first person NEG negation 

3 third person PERF perfective 
APPL applicative RDP reduplication 
DEF definite article REL relative clause 
DIR directional SG singular 
INDEF indefinite 
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Chapter 7 


Austronesian predication and the 
emergence of biclausal clefts in 
Indonesian languages 


Daniel Kaufman 
Queens College & ELA 


Information structure is tied up closely with predication in predicate-initial Philippine-type 
languages. In these languages, subjects are presupposed and the predicate position operates 
as a default focus position. The present paper argues that there is no need for biclausal focus 
constructions in these languages due to the nominal properties of their event-denoting pred- 
icates. Non-Philippine-type Austronesian languages develop a stronger noun/verb distinc- 
tion that I argue ultimately gives rise to biclausal focus constructions. The building blocks of 
biclausal clefts in Indonesian languages are analyzed as well as the nature of predication in 
Philippine-type languages. Finally, I discuss a paradox in the syntax of definite predicates in 
Philippine-type languages. In a canonical predication, the less referential portion (the pred- 
icate) precedes the more referential portion (the subject). However, when both portions are 
definite the relation is reversed such that the more referential portion must be initial. I tie 
this to animacy effects found in other Austronesian languages in which a referent higher 
on the animacy scale must linearly precede one that is lower. 


1 Introduction 


Languages vary widely in their strategies for indicating pragmatic relations such as Fo- 
cus and topic. In the simplest case, a language may employ dedicated morphological 
markers which combine directly with focused or topical constituents. More common 
perhaps is the use of dedicated syntactic positions, typically on the left periphery of the 
clause, which host focused or topicalized constituents. Finally, all languages are thought 
to express basic information structure via prosodic means, although the actual imple- 
mentation differs significantly from language to language. Parallel to pragmatic rela- 
tions such as TOPIC and Focus, all languages possess a basic SUBJECT-PREDICATE relation 
which is partly independent of pragmatics but which also intersects with the phenomena 
of topic, focus and presupposition. While there has been notable success in the defini- 
tion and analysis of pragmatic relations over the last several decades, the true nature 
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of the subject-predicate relation remains one of the most fraught topics in the history 
of linguistics, with its beginnings in the work of Aristotle and Plato. Indeed, as David- 
son (2005: 83) states with regard to predication, “It is a mark of Plato's extraordinary 
philosophical power that he introduced a problem that remained unresolved for more 
than two millennia.” As could be surmised solely from the disagreement among syntactic 
analyses of English, a robust cross-linguistic definition of predicate and subject remains 
even more elusive. Copular clauses and cleft structures are of special interest here as 
notional predicates can occupy the canonical syntactic position of the subject (and vice 
versa) in these sentence types. In the present work, I am concerned with the interplay 
between the subject-predicate relation and information structure in Austronesian lan- 
guages. Specifically, I would like to answer the following three questions: 


(i) What is the evidence for biclausal cleft structures in Philippine-type! versus In- 
donesian languages? (83) 


(ii) How and why do biclausal cleft structures come into being outside of Philippine- 
type languages? (84) 


(iii) What does it mean to be a predicate in Philippine-type languages? (85) 


We can briefly preview the answers put forth below. With regard to (i), I argue that a gen- 
uine cleft structure in Indonesian languages emerges from a more symmetric Philippine- 
type system where true biclausal clefts do not exist. In regard to (ii), I show how a distinc- 
tion between plain modification and modification by relative clause develops in Indone- 
sian languages and how old functional morphology is recruited to mark relative clauses. 
Finally, regarding (iii), I argue that the predicate-subject relation in Philippine-type lan- 
guages is determined by the relative referentiality of the two basic parts of a proposition 
similar to copular clauses in more familiar languages. The more referential half of the 
predication (i.e. the subject) follows the less referential half (i.e. the predicate). An inter- 
esting complication is that when both the predicate and the subject are referential, the 
part of the predication higher on the referentiality/animateness scale precedes the one 
lower on the scale. That is, the principle which derives the normal predicate-initial order 
in Philippine-type languages appears to be reversed in these cases. 

Languages that sit on the border of the Philippine-type and non-Philippine-type are 
especially interesting in regard to information structure. In $4, I examine Balantak as a 
language that appears to be transitioning from monoclausal to biclausal focus construc- 
tions. This in turn sheds light on the development of biclausal constructions in languages 
that have diverged even further from the Philippine-type, such as Malay/Indonesian. 

In 82, I attempt to define all the relevant categories in terms that are as theory-neutral 
as possible. The relevant notions for our purposes are subject and predicate (82.1), prag- 


l*Philippine-type languages" refer to a typological grouping rather than a geographic or phylogenetic one. 
It is used here to refer to Austronesian languages with a historically conservative set of (3 or 4) voices 
(following Blust 2002). Crucially, in Philippine-type languages, these voices are symmetrical in that a pred- 
icate can only bear one voice at a time and the agent argument is not demoted in the non-actor voices, as 
it would be in a canonical passive. 
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matic relations such as topic, focus and presupposition (S2.2), and the various types of 
clefts together with their component parts (82.3). 


2 Defining the terms 


2.1 Subject and predicate 


Several streams in philosophy of language, semantics and even formal syntax have taken 
a purely taxonomic approach to the notion of predicate with the goal of seeking a unify- 
ing trait in these types. The philosophical literature, in particular, is replete with claims 
such as “predicates ascribe” and “predicates designate”. One of Frege's most important 
contributions to our current understanding of predicate involved viewing it as an ele- 
ment with unsaturated arguments; in his words, “not all parts of a thought can be com- 
plete; at least one must be unsaturated or predicative; otherwise they would not hold 
together” (Frege 1892/1997: 193).2 For Frege, linguistic elements such as names and defi- 
nite descriptions could not be classified as predicates as they cannot be naturally thought 
of as having unsaturated arguments in the way that “runs” has a single unsaturated ar- 
gument and “punches” has two unsaturated arguments. But the fact that languages rou- 
tinely place definite descriptions, names and even pronouns in the predicate position of 
clauses that bear all the morphosyntactic hallmarks of canonical predication poses an 
immediate empirical challenge to Frege's view of predicates as a natural class of linguistic 
elements.? Under the direct or indirect influence of Frege, a large body of work in syntax 
has treated such sentences as something other than pure predications. This has led, for 
instance, to a taxonomic tradition in the study of copular sentences (Mikkelsen 2011), in 
which copular clauses can come in specificational, equational and identificational flavors 
which largely correlate to the referentiality of the "predicate". 

For present purposes, the notions SUBJECT and PREDICATE can be defined following 
type-theoretic predicate logic. Namely, the subject and predicate are the two constituents 
that combine to yield a TRUTH VALUE. It is not always a trivial matter to determine what 
types of strings have a truth value and which do not. In the simple case, we can compare 
the modification relation in (1) with the predication in (2). 


(1) Tagalog 
ang matangkad na dalaga 
NOM tall LNK girl 


‘the tall girl’ 


? As a reviewer notes, an important aspect of Frege’s contribution was to assimilate all types of predication to 
verbal predication. In Fregean semantics, predicates can be defined simply as categories with unsaturated 
terms. 

3As noted by Modrak (1985), among others, philosophers have chiefly attempted to explain metaphysical 
predication, which only bears an incidental relation to linguistic predication. Patterns of linguistic predi- 
cation across human language have thus not played a major role in philosophical investigations. 
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(2) Tagalog 
Matangkad ong dalaga. 
tall NOM girl 


"Ihe girl is tall? 


Tagalog speakers understand (1) as having a potential reference in the world but not a 
truth value, whereas the opposite intuition holds for (2) (which nonetheless contains the 
referring expression ang dalaga ‘the girl’). In (1), the entire string consists of a single 
Determiner Phrase (DP) marked with the nominative case determiner ang. In (2), the 
string contains two major phrases, a predicate, followed by the nominative marked DP. 

Himmelmann (1986) takes the key feature of predications to be "challengeability": ^A 
predicative structure always allows for — or even demands - a yes-no reaction" (Himmel- 
mann 1986: 26). This view, correctly, I believe, draws a strong line between predication 
on the one hand and modification, secondary predication and even subordinate predica- 
tion on the other hand, a distinction which not all theories abide by. I also agree here 
with Himmelmann in understanding predicates to be crucially a relational concept rather 
than an inherent property of certain types of linguistic elements.* 

A particularly vexed question in Philippine linguistics regards which of the two argu- 
ments of a basic transitive clause should be considered the "subject", with all possible 
answers having been posited by different analysts (including “none of the above", see 
Schachter 1976). In (3), we see three variations on a patient voice clause and in (4) we 
see the same kind of variations in an actor voice clause.? Following a type-theoretic ap- 
proach, we can see that there is an important difference between the (b) and (c) sentences. 
In an out-of-the-blue setting, (3b) and (4c) are judged to have truth values but (3c) and 
(4b) are not. The latter two sentences are not ungrammatical, but they must depend on 
the preceding discourse to obtain a truth value. That is, as long as anyone ate the tofu, 
(3b) will be judged true but (3c) cannot be judged as true or false even if we know that 
Juan ate something. Similarly, for just anyone to have eaten tofu does not make the actor 
voice sentence in (4b) true. In order for it to be evaluated as true or false, the preceding 
discourse has to provide the reference for the elided nominative argument. 


^| prefer though to rely on the potential for a truth value rather than the notion of challengeability as the 
latter cannot cleanly apply to imperatives and interrogatives. While some views on questions and imper- 
atives take them to lack truth values, questions and imperatives seem to me best understood as other 
(non-assertive) things we do with truth values. Declaratives assert that a proposition is true or false; yes- 
no questions request the hearer to posit a true or false judgement on the proposition; content questions 
request a value to make a proposition with a variable true; imperatives demand that the addressee make 
the proposition true. None of these acts could be executed if the proposition itself had no truth value. Thus, 
just as Dog is not a predication, neither is Dog? a possible yes-no question, nor Dog! a command. I believe 
these facts can be unified in any approach that sees speech acts as operations on truth values rather than 
restricting truth values to assertions. 

"ln the non-actor voices, the agent is expressed in the genitive case, treated by some as ergative case, while 
the argument selected by the voice morphology is expressed in the nominative/absolutive case. 
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(3) TAGALOG 


a. K<in>ain-@ ni Juan ang tokwa. 
<BEG>eat-PV GEN Juan NOM tofu 


“Juan ate the tofu: 


b. K<in>áin-@ ang tokwa. 
<BEG>eat-PV NOM tofu 


"[he tofu was eaten: 


c. ZK«in-áin-O ni Juan. 
<BEG>eat-PV GEN Juan 


‘Juan ate (it)? 


(4) TAGALOG 


a. K<um>dain ng tokwasi Juan. 
<AV.BEG>eat GEN tofu NOM Juan 


‘Juan ate tofu.’ 


b. %K<um>din ng tokwa. 
<AV.BEG>eat GEN tofu 


*(S/he) ate tofu? 


c. K«um-áin si Juan. 
<AV.BEG>eat NOM Juan 


“Juan ate: 


The key generalization then is that a predicate must combine with a nominative/abso- 
lutive (in Tagalog, ang marked) argument to obtain a truth value. On this basis, we can 
refer to the ang phrase as the subject and the clause-initial phrases in the above exam- 
ples as the predicate. Precisely parallel facts have been observed for several Polynesian 
languages, such as East Futunan and Tongan (Dukes 1998; Tchekhoff 1981; Biggs 1974). 
Dukes (1998) sums up the Tongan situation in a way that describes Philippine-type lan- 
guages equally well: 


An omitted ergative argument need not presuppose any particular referent in the discourse 
and may in fact be interpreted existentially in much the same way an omitted agent in an 
English passive is interpreted. When an absolutive is omitted however, it must be inter- 
preted referentially as a null pronoun picking out some previously mentioned individual. 
As Biggs puts it, native speakers of these languages consider sentences which are missing 
an absolutive to be ‘incomplete’, whereas sentences missing ergatives are not. 


Note that this definition of subject is completely independent from the "subject" as iden- 
tified by classic syntactic diagnostics like those posited by Keenan (1976), e.g. binding 
relations, raising and control, many of which converge on the more agentive argument 
of a transitive clause, as in English, and only some of which seem to pick out the ang 
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phrase.Ó The concept of the subject-predicate relation as posited above is inherently sym- 
metrical; the subject and predicate are simply the two (topmost) constituents which are 
combined to yield a truth value. However, few if any human languages treat these con- 
stituents symmetrically. There are clearly distinct positions for subject and predicate in 
the vast majority of described languages in the world." Certain types of copular clauses, 
however, are apparently reversible in many languages but with subtly different entail- 
ments. Jespersen (1937/1984; 1965) notes the distinct meanings of ‘my brother’ in the 
following passage: 


Now, take the two sentences: 


My brother was captain of the vessel, and 
The captain of the vessel was my brother. 


In the former the words my brother are more definite (my only brother, or the 
brother whom we were just talking about) than in the second (one of my brothers, 
or leaving the question open whether I have more than one). (Jespersen 1965: :153) 


Based on a family of similar observations, Jespersen develops the idea that choice of 
subject and predicate is based on relative familiarity. We can therefore conceptualize 
predication as an inherently symmetrical relation but one whose syntactic expression 
is highly sensitive to referentiality. That is, the more referential of the two elements in 
a predication relation will be mapped to a position which we can, following tradition, 
refer to as “subject” and the less referential of the two will be mapped to a position we 
call the “predicate”. In English, a subject initial language, there is only one way to make 
a predication between Mary and a linguist, that is, by treating Mary as the subject and 
mapping a linguist to the predicate, as in (5). We say that Philippine-type languages are 
"predicate-initial" because the less referential component of the predication relation is 
clause-initial while the more referential component follows it, as shown in (6). Just as in 


The ang phrase relation of Tagalog which I refer to here as subject has, in fact, been given so many names 
over the years to distinguish it from the subject of a hierarchical argument structure that it is hard to 
keep track. Among others, we find, “predicate base", “pivot”, “topic”, and the neutral ang phrase. (See Blust 
2002, Ross 2006, Kroeger 2007 and Blust 2015 for good summaries of the terminological and conceptual 
challenges presented by Austronesian voice and case.) I maintain the term subject here because of the fa- 
miliarity of the subject-predicate relation, which is specifically relevant here. Moreover, the hierarchical 
relations between co-arguments of a predicate (e.g. the relation between subjects and objects) is not rele- 
vant for our purposes and so we can avoid the usual confusion. These are, however, very different relations 
that should be kept separate terminologically and theoretically, as for instance in Foley & Van Valin (1984). 

Diverging from most generative syntacticians, den Dikken (2006) does argue for a symmetrical view of 
predication but applies the term very broadly to include phenomena that are generally analyzed as mod- 
ification. Heycock’s (2013) overview of predication in generative syntax shows how far this line of work 
has diverged from the traditional Aristotelian concept of a predicate as part of a bipartite proposition that 
yields a truth value. 

8This is also very much in line with the view of predication developed by Williams (1997: 331), who treats 
referential NPs as potential predicates: “It is sometimes thought that a predicate cannot be ‘referential’. It 
seems to me though that the best we can say is that the predicate is less “referential” than its subject, and 
that what we really mean is something having to do with directness of acquaintance.” Concomitantly, for 
Williams (1997: 323), “A phrase is not predicational in any absolute sense, but only in relation to its subject.” 
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English, the relative referentiality of the two parts of the predication determines their 
position in the clause. Reversing the order, as in (6b), results in ungrammaticality. 


(5 English 
a. Mary is a linguist. 


b. *A linguist is Mary. 


(6) Tagalog 
a. Abogado si ` jojo. 
lawyer NOM Jojo 
‘Jojo is a lawyer: 
b. *Si Jojo abogado. 
NOM Jojo lawyer 


I would thus like to take a Jespersonian approach here, which does not rely on purported 
universal properties of subjects (e.g. referring") or predicates (e.g. “unsaturated”). Hav- 
ing established this still informal, but hopefully workable, concept of predication, we 
turn to the impressive flexibility of Philippine languages in mapping lexical categories 
to the predicate and subject positions, as exemplified in (7). This was noted by Bloom- 
field (1917) for Tagalog and discussed extensively in the subsequent literature (Gil 1993; 
Himmelmann 1987; 1991; Foley 2008; Schachter & Otanes 1982; Kaufman 20094). 


(7) Tagalog 
a. K«um-a-káin ang laláki. 
<AV>IMPRF~eat NOM man 
"Ihe man is eating: 
b. Laláki ang k«um-a-káin. 
man NOM <AV>IMPRF~eat 


"Ihe eating one is a man? 


In a very simplistic schema, we can conceive of English and Tagalog differing as in (8) and 
(9). Whereas the English clausal spine makes crucial reference to lexical categories such 
as NP and VP, Philippine-type phrase structure seems to refer primarily to the functional 
categories PredP and SubjP which can in turn host phrases of any lexical category (XP 
and YP below)? 


?Proponents of this view, in one form or another, include Scheerer (1924); Lopez (1937/1977); Capell (1964); 
Starosta et al. (1982); Egerod (1988); DeWolf (1988); Himmelmann (1991); Lemaréchal (1991); Naylor (1975; 
1980; 1995); Kaufman (20092). Byma (1986), on the other hand, and most subsequent syntacticians (Richards 
2000; Rackowski 2002; Aldridge 2004), have defended (or assumed) an analysis in which Tagalog is also 
structured like (8). Richards (2009a,b) explicitly argues that all predications in Tagalog are mediated by a 
verbal element but that this element is null in most copular clauses. For reasons of space, I refer the reader 
to Kaufman (forthcoming) for a rebuttal of this view. 
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(8) ENGLISH-TYPE PHRASE STRUCTURE 


NP VP 
(Subj) (Pred) 


(9) PHILIPPINE-TYPE PHRASE STRUCTURE 


S 


Z s 


PredP  SubjP 


NS 


XP NOM YP 


In order to understand how phrases are mapped to the predicate and subject position in 
Philippine-type languages we must first define the crucial pragmatic concepts that come 
into play. We turn to this in the following subsection. 


2.2 Pragmatic relations 


Presupposition and focus are often described in shorthand as old information and new 
information, respectively. While this evokes the right idea, Lambrecht (1994) argues 
against such oversimplified definitions. I follow Lambrecht's definitions for the relevant 
categories, as given below Jl 


(10) Presupposition, assertion, focus and topic (Lambrecht 1994: 52, 213, 131) 


a. PRAGMATIC PRESUPPOSITION: The set of propositions lexico-grammatically 
evoked in a sentence which the speaker assumes the hearer already knows 
or is ready to take for granted at the time the sentence is uttered. 


b. PRAGMATIC ASSERTION: The proposition expressed by a sentence which the 
hearer is expected to know or take for granted as a result of hearing the sentence 
uttered. 


c. FOCUS: The semantic component of a pragmatically structured proposition 
whereby the assertion differs from the presupposition. 


d. TOPIC: A referent is interpreted as the topic of a proposition if in a given situa- 
tion the proposition is construed as being about this referent, i.e. as expressing 
information which is relevant to and which increases the addressee's knowl- 
edge of this referent. 


10 Abbott (2000) offers an alternative view of presuppositions as non-assertions rather than information 
known to the hearer. This may well fit the Austronesian data better but I leave this question to further work. 
A good overview of the issues and literature surrounding presuppositions is found in Kadmon (2000). 
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While the Tagalog ang phrase is often referred to as “topic” in different analytic tradi- 
tions, it has been shown clearly by Naylor (1975), Kroeger (1993) and Kaufman (2005) to 
have no inherent pragmatic status beyond its definiteness or referentiality. Tagalog and, 
it would seem, all other Philippine languages have a bona fide topic position on the left 
periphery. In Tagalog, the fronted pragmatic topic, is followed either by the topic marker 
ayorapause. In Tagalog, but not all Philippine languages, there is also a dedicated focus 
position on the left periphery of the clause which hosts oblique phrases, exemplified in 


(11). 


(11) Tagalog 
[Sa Manila]poc=ka=ba p<um>unta? 
OBL Manila=2s.NOM=QM <AV.PRF>g0 


‘Did you go to MANILA?’ (‘Was it to Manila that you went?’) (Kaufman 2005: 182) 


The presence of an oblique phrase in the focus position in the left periphery bifurcates 
the sentence into a focus and a presupposition. In the above, 'to Manila' is the focus 
and it is presupposed that the addressee had gone somewhere. The question would be 
inappropriate if this information was not already part of the discourse in the same way 
that "Was it to Manila that you went?’ would be inappropriate in an out-of-the-blue 
context. On the other hand, a phrase in the left peripheral topic position followed either 
by the topic marker or a pause, needs to either be in the discourse already or contrasted 
with a similar argument that belongs to the same set. In (12), the fronted oblique phrase 
can serve as a contrastive topic, pragmatically akin to English, ^What about Naga, have 
you gone there?'. Note that the topic is further outside the clause and thus does not host 
second position clitics. Furthermore, it does not trigger a presupposition. The question in 
(12) is still felicitous without it being in the discourse that the addressee went somewhere. 


(12) Tagalog 
[Sa Naga], p<um>unta=ka=ba? 
OBL Naga <AV.PRF>g0=2S.NOM=QM 


‘To Naga, did you go (there) already?’ 


With this brief introduction we are now prepared to turn to the mapping of these prag- 
matic categories on to phrase structure across several Austronesian languages. 


2.3 Clefts 


Cleft structures make use of the subject-predicate relation to satisfy requirements of 
information structure. Subjects are canonically (but not necessarily) topic-like and pred- 
icates canonically (but not necessarily) align with the focused constituent of a clause. 
Thus, focused subjects and presupposed predicates constitute non-canonical alignments 
which languages may either tolerate or avoid by means of various syntactic mechanisms. 
Languages with a high tolerance for focused subjects, such as English, will allow such a 
subject to be merely marked by intonation, as in (13a). However, an option also exists to 
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shift such a focused subject into the predicate position, as in the it-cleft sentence in (13b) 
(Lambrecht 1994). 


(13) a. Only [John]goc knows Jane. 
b. It's only [John]roc who knows Jane. 


At the same time, the presupposition of an English it-cleft is demarcated syntactically 
by means of a relative clause. Thus, while both (13a) and (13b) contain a presupposition 
‘X knows Jane’, its pragmatic status is only reflected syntactically in (13b), by means of 
the relative clause who knows Jane. The English it-cleft can thus be said to create a more 
transparent mapping between the syntactic and pragmatic structure of the clause. 

Other languages do not tolerate non-canonical mappings such as that in (13a). For 
instance, the Malay/Indonesian adverb saja 'only', which must combine with a focused 
constituent preceding it, cannot associate with a subject in a simple declarative clause, as 
shown in (14a). Instead, a cleft structure is required in which the presupposed predicate 
is packaged as a relative clause, as shown in (14b). 


(14) Indonesian 


a. Presiden (*saja) bisa menilai kinerja menteri. 
president only can Av:evaluate output minister 
“A president can evaluate a minister's output? 

b. Presiden saja yang bisa menilai kinerja menteri. 
president only RELT can Av:evaluate output minister 


‘Only a president can evaluate a minister's output. 


As seen in (15), no special manipulation is required in order to narrowly focus the pred- 
icate or a part thereof, as this respects the canonical mapping between predicate and 
focus. 


(15) Indonesian 
Presiden bisa menilai kinerja menteri saja. 
president can Av:evaluate output minister only 


‘A president can only evaluate a minister’s output. 


Most interestingly, we find that the Austronesian tendency to express presuppositions 
syntactically manifests itself in Philippine English, as well. Whereas English can employ 
prosodic focus alone in a sentence like (16), Philippine English will invariably employ 
a TH-cleft (to be introduced below) in the same function, as seen in (16) and (17). The 
Tagalog equivalent is given in (18). 


(16) US ENGLISH 
Joun will carry your bag. 


(17) PHILIPPINE ENGLISH 
John will be the one to carry your bag. 
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(18) Tagalog 
Si Juan ang mag-da-dala ng bag mo. 
NOM Juan NOM AV-IMPRF-CaITy GEN bag 2SG.GEN 


‘Juan will carry your bag: (Lit. ‘Juan will be the one to carry your bag’) 


Clefts thus function to transparently bifurcate the sentence into a focus and a presup- 
position. As seen in the difference between English and Indonesian above, languages 
differ as to the extent to which they require such transparency. In terms of the syntactic 
hallmarks of cleft sentences, Lambrecht (2001) offers the following definition: 


(19) CLEFT CONSTRUCTION (Lambrecht 2001: 467) 
A cleft construction is a complex sentence structure consisting of a matrix clause 
headed by a copula and a relative or relative-like clause whose relativized argu- 
ment is coindexed with the predicative argument of the copula. Taken together, 
the matrix and the relative express a logically simple proposition, which can also 
be expressed in the form of a single clause without a change in truth conditions. 


There are two components in (19). The structural component defines clefts as a biclausal 
structure containing a relative clause in a larger copular sentence. The semantic compo- 
nent of the definition relates clefts to simpler monoclausal sentences by virtue of their 
similar meaning. The cleft and the monoclausal structure differ in information structure 
and implicature but not in their basic truth conditions. 

Lambrecht advocates for a taxonomy of English clefts as in (20), where caps indicates 
focus: 


(20) CLEFT TYPES (Lambrecht 2001: 467) 
a. Ilike CHAMPAGNE. Canonical sentence 


b. Itis CHAMPAGNE (that) Ilike. IT cleft 
c. WhatlIlike is CHAMPAGNE. WH cleft (pseudo-cleft) 


d. CHAMPAGNE is what I like. Reverse WH cleft 
(reverse/inverted pseudo-cleft) 


In (20a) we find the canonical monoclausal sentence which is roughly equivalent in its 
truth conditions to the following clefts. The it-cleft places the focused phrase in the 
predicate position of a copular clause in which a dummy pronoun is the subject. The 
presupposition is packaged as a relative clause.!! The types in (20c) and (20d) are termed 
alternatively wh-clefts or pseudo-clefts (although I will use only the latter term in the 
following). Here, there is no dummy subject. A relative clause headed by an interrogative 
pronoun like ‘what’ is in a copular construction with a focused phrase. In English, this 
type is reversible so that the presupposition can be the subject of the matrix clause, as in 
the standard pseudo-cleft exemplified in (20c), or the predicate of the matrix clause, as 


"The precise relation between the two clauses in the English it-cleft has been subject to rather intense 
scrutiny, summed up recently by Reeve (2010). 
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in the inverse pseudo-cleft exemplified in (20d). To Lambrecht's taxonomy, we can add 
the "TH-cleft", in (21), first identified as a separate type by Ball (1977). Here, the relative 
clause modifies a definitely determined semantically bleached noun phrase, e.g. ‘the one’, 
‘the thing’, etc. 


(21) a The one/thing I like is CHAMPAGNE. TH-cleft 
b. CHAMPAGNE is the one/thing I like. | Reverse/inverse TH-cleft 


The syntax of clefts accommodates information structure in several ways. In structures 
like the English it-cleft (It’s JOHN who bit me), part of the focus semantics derives from 
mapping a phrase to the object position of a copular structure, a more hospitable posi- 
tion for focused material than the subject position. In all types of cleft sentences, the 
presupposition is clearly demarcated by packaging it as a relative clause of some type. 

It is clearly not the case that relative clauses always contain a presupposition outside 
of cleft sentences. For instance, the relative clause subject in (22) does not presuppose 
that someone will come after closing time. The sentence in (23), however, does entail such 
a presupposition, and this shows that it is the determiner or demonstrative that gives rise 
to the presupposition rather than anything inherent to the relative clause itself.!? 


(22) Any customer who arrives after closing time will not be served. 
(23) Iwill take care of those customers who arrive after closing time. 


In addition to determiners of a relativized noun, a phrase headed by an interrogative pro- 
noun can be at least partly responsible for projecting a presupposition. A certain chess 
hustler in Greenwich Village used to rile his opponents with the following rhetorical 
question during the heat of a match: 


(24) Do you know what I like about your game? Absolutely nothing! 


The jarring quality of the answer is the effect of canceling the presupposition in the 
question. The infelicity of the following cleft sentences in (25) makes this clear. That the 
presupposition does not come directly from the relative structure can again be seen in 
the felicitous (26), which contains a relative clause headed by the quantifier ‘nothing’, 
and which carries no presupposition. 


(25) a. %Nothing is what I like about your game. 
b. %What I like about your game is nothing. 


(26) ‘There is nothing that I like about your game. 


12See Kroeger (2009) for a similar point with regard to mistaken assumptions about headless relative clauses. 
As Kroeger shows for Tagalog, the presuppositions in such constructions are triggered by the determiners 
rather than the relative structure itself. Note that Kroeger (1993) claims that the subject actually precedes 
the predicate in Tagalog translational equivalents to English cleft sentences. On this view, sino ‘who’ would 
be the subject in a sentence like (27). Kroeger (2009: fn.3), however, recants this position and views (ap- 
parent) headless relatives like ang dumating ‘the one who arrived’ in (27) as being in subject position. 
This change in perspective was prompted by Tagalog’s similar behavior to Malagasy as analyzed by Paul 
(2008) and Potsdam (2009), who offer several pieces of evidence for the predicatehood of (non-adjunct) 
interrogative phrases. 
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We can say then that relative clauses pave the road for presuppositions without neces- 
sarily triggering them directly. As we will see in the following section, what triggers the 
presuppositional reading in putative Philippine-type clefts is the definite semantics of 
the nominative case marking determiner (e.g. Tagalog ang). Unlike English, interroga- 
tive pronouns are never employed for this purpose in Philippine-type languages. 


3 The syntactic structure of Austronesian clefts 


A key point of variation between Philippine-type and non-Philippine-type Austronesian 
languages can be exemplified with the following example form Tagalog (27) and formal 
Indonesian/Malay (28). 


(27) Tagalog 
a. Sino ang d<um>ating? 
who NOM <AV.BEG>arrive 


“Who arrived?’ 


b. D<um>ating ang guro. 
<AV.BEG>arrive NOM teacher 


“The teacher arrived: 


(28) Formal Indonesian/Malay 
a. Siapa yang datang? 
who RELT arrive 
"Who arrived?' 


b. Datang abang-nya... 
arrive elder.brother-3s.GEN 


‘His brother arrived..’ (Hikayat Pahang 128:9) 


Nearly all Philippine-type languages require some form of case marking on clausal argu- 
ments. In (27), this can be seen for Tagalog in the case marking determiner ang, which can 
be glossed nominative or absolutive (see Kaufman 2017 for discussion), but whose func- 
tion is also tightly bound up with a definite/specific reading of the following NP (Him- 
melmann 1997).? There are two crucial things to note in this comparison. First, while 
Philippine-type languages require such case markers, only few Austronesian languages 
of Indonesia employ case marking on arguments (see Himmelmann 2005). The relativizer 
yang is strongly preferred in the subject question in (28a) but would not be acceptable 
in (28b) and can thus be easily distinguished from ang in Tagalog and the equivalents in 
other Philippine languages. Second, the case markers of Philippine languages do not dis- 
criminate between apparent verbal and nominal complements. Constantino (1965) was 


P'The precise semantics of ang has been debated in the literature. A non-definite reading can be obtained in 
Tagalog with ang isang... NOM one:LNK. However, without the presence of the numeral isa ‘one’, felicitous 
use of ang requires familiarity on the part of the hearer. For this reason, I maintain that definiteness, rather 
than specificity, best captures the pragmatic function of ang. 
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Table 1: Philippine sentence patterns following Constantino (1965) 


Tagalog kina:?in nan bata? ay manga 
Bikolano kinakan kan aki? ap manga 
Cebuano ginka:?un han bata? ay manga 
Hiligaynon kina?un san bata an pahu? 
Tausug kya?un sin bata? in mampallam 
Ilokano kinnan dyay ubin ti manga 
Ibanag kinan na abbig ik manga 
Pangasinan kina =y ugaw su manga 
Kapampangan pena=na nin ank in manga 


eat:PV.PRF GEN child NOM mango 
"Ihe child ate the mango: 


Table 2: Philippine sentence patterns following Constantino (1965) 


Tagalog manga an kina:?in nay basta? 
Bikolano manga an kinakan kan aki? 
Cebuano manga an ginka?un han bata? 
Hiligaynon pahu? ar kina?un san ba:ta 
Tausug mampallam in kya?un sin bata? 
Ilokano manga ti kinnan dyay ubin 
Ibanag manga ik kinan na abbin 
Pangasinan manga su kina =y ugaw 
Kapampangan manga in peya=na nin anak 
mango NOM  eatPV.PRF GEN child 


‘It was the mango that the child ate’ 


the first to show that this is a far reaching characteristic of Philippine languages with the 
comparisons in Table 1 and Table 2. In no Philippine language do putative pseudo-clefts 
contain an overt relative marker, a wh-element, a dummy head noun, or any extra sign 
of nominalization. The voice marked words that serve as predicates in Table 1 are simply 
bare complements to the determiner in Table 2. 

It turns out there is good reason for this symmetry. Starosta et al. (1982) first noted 
that the well-known Austronesian “voice” paradigm appeared to involve a reanalysis of 
nominalizations as voice markers, as shown in (29). 1 


"Unlike the rest of the forms in (29), the actor voice morpheme *<um> is not thought to have developed 
from a nominalizer, as it can be reconstructed to the proto-Austronesian verbal paradigm (Ross 2002; 2009; 
2015). 
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(29) Austronesian voice morphology 
*-en PATIENT NOMINALIZER > PATIENT VOICE 
*-an LOCATIVE NOMINALIZER > LOCATIVE VOICE 
*Si- INSTRUMENTAL NOMINALIZER > INSTRUMENTAL VOICE 
*«um» AGENT VOICE/NOMINALIZER 


I argue in Kaufman (20092) that the large number of morphological and syntactic simi- 
larities between nouns and verbs can be attributed to the reanalysis in (29). This receives 
further support from Ross (2009), who shows that Puyuma, a Formosan language, main- 
tains a division of labor where the verbs of Philippine-type main clauses are restricted 
to relative clauses. Another set of verbal morphology, now only used in a subset of 
Philippine-type languages for imperatives and subjunctives, is used to mark main clause 
verbal predicates in Puyuma.P It seems then that the reanalysis of relative clauses as 
main clause predicates in an earlier Austronesian proto-language had the effect of eras- 
ing any significant differences between relative clauses and main clauses in the daugh- 
ter languages. If words formed with the morphology in (29) are nominalizations, it is no 
surprise that they can serve as direct complements of determiners such as seen above 
in Table 2. There is no need to relativize the verb phrase in sentences such as those in 
Table 2 if the verb is already akin to a thematic nominalization. To make this concrete, 
we could compare the patient voice morpheme in (29) to English -ee in employee. English 
allows for the two semantically similar sentence in (30). 


(30) a. George is the one Jane employs. 


b. George is Jane's employee. 


Clearly, direct relativization from a finite clause is far more common and productive in 
English than thematic nominalization. But in Austronesian, thematic nominalization, as 
in (30b), was developed to an unusual degree for the purpose of forming relativizations 
and these then spread to main clauses.!6 A consequence of this, particularly important for 
focus constructions, is that apparent clefts in Philippine-type languages are monoclausal, 
just as English (30b) is monoclausal. The key facts are reviewed below. 


3.1 Apparent Philippine-type clefts: monoclausal or biclausal? 


A reasonable analysis of the English pseudo-cleft is shown in (32), which can be com- 
pared to the canonical monoclausal declarative sentence in (31). 


DForms which take this set of morphology, referred to as the dependent paradigm by Wolff (1973) and the 
non-indicative paradigm by Ross (2002), cannot serve as the complement of a determiner or case marker 
in Philippine languages (Kaufman 2009a: 25). 

óWhile this is unusual, it is not unique. Shibatani (2009) notes typological similarities between the use of 
thematic nominalizations in Austronesian and similar phenomena in Qiang, Yaqui, Turkish and Quechua. 
With regard to terminology, I refer below to these historically nominalized predicates, i.e. “Philippine-type 
verbs", as participles rather than verbs or nouns. The term participle is preferable because these forms have 
an intermediate status between plain nouns and the historical verbs of Austronesian, the latter which have 
been relegated to non-indicative contexts in Philippine languages. 
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(31) 
TP 
P dam 
DP T 
The child 
e s 
: DP 
ate 
Ee 
the mango 
(32) 
TP 
o 
DP T 
x a S 
The mango T VP 
is Fd 
V CP 
P ds 
what; c 
P uim 
C TP 
Yeo, 


the child V fi 
ate 

The English pseudo-cleft is considered biclausal because it contains two separate ex- 
tended projections of a verb phrase, headed by is in the matrix clause and ate in the 
relative clause in (32). Each domain can mark categories like tense, negation and agree- 
ment independently. In contrast, the monoclausal (31) only contains a single domain for 
tense, negation and agreement. There is little reason to believe such a distinction exists in 
Philippine-type languages. The translational equivalents of (31) and (32) in Tagalog both 
appear monoclausal, as suggested by the analysis in (33) and (34). The only difference 
is that the participle (descended historically from a nominalization) is in the predicate 
position in (33) and in the subject position in (34). We can treat both cases, however, as 
copular clauses, indicated by the (null) Cop in both structures. 
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(33) 
TP 
PredP T 
PartP T DPNoM 
qe SNR Cop = 
SCH DPGEN D NP 
inain ang 
eat.PV.PRF Pann NOM Z S 
nar bata manga 
GEN child mango 
(34) 
TP 
PredP T’ 
NP T DPNoM 
e Cop p dcm 
manga D PartP 
mango ang 
NOM me "Ss 
his DPGEN 
inain 
eat.PV.PRF YO 
nan bata 
GEN child 


One of the few arguments that has been adduced in favor of a biclausal structure for sen- 
tences such as (34) is a putatively asymmetric pattern of clitic placement." As discussed 


"The notion that apparent clefts in Philippine languages are biclausal is widespread although often not 
explicitly argued for. Nagaya (2007: 348), for instance, analyzing Tagalog information structure in an RRG 
framework, states “A cleft construction in Tagalog is an intransitive clause where its single core argument 
is a headless relative clause, and its nucleus is a noun phrase coreferential with the gap in the headless 
relative clause.” as illustrated in his (i), where S represents a gap in the relative clause. 


(i) Si Boyet; ang [p<um>atay [S] kay Juan]. 
NOM Boyet Nom <Av>kill OBL Juan 
"Ihe who killed Juan is Boyet: (Nagaya 2007: 348) 


Even in non-derivational frameworks such as RRG, the gap strategy employed commonly for relative 


clauses in Indo-European languages is typically applied to the analysis of Tagalog without consideration 
of alternative analyses. 
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in detail in Kaufman (2010b), Tagalog pronominal clitics are positioned after the first 
prosodic word in their syntactic domain. Aldridge (2004: 320), assuming that such clitics 
strictly take the clause as their domain, presents the data in (35) as an argument for the 
biclausal structure of apparent clefts. If such sentences were monoclausal, it would stand 
to reason that clitics could follow the interrogative directly as in (35b), but such a pattern 
is ungrammatical. 


(35) Tagalog (Aldridge 2004: 319) 


a. Ano ang g<in>a~gawa=mo? 
what NOM <BEG>IMPRF~d0.PV=2S.GEN 


‘What are you doing?’ 


b. *Ano=mo ang g<in>a~gawa? 
what=2S.GEN NOM <BEG>IMPRF~do.PV 


Second position clitics, however, are not only clause-bound; they are also bound within 
the DP, as can be seen in the following comparison with the possessive clitic =ko 18G.GEN. 
With a bare predicate like kaibigan ‘friend’, as in (36a), the possessive clitic attaches to 
the first element in the clause, in this case, negation. When the predicate is a case marked 
DP, as in (36b), the associated genitive clitic cannot take second position in the clause 
and must attach after the first prosodic word within the DP. 


(36) Tagalog 
a. Hindi[-ko]-siya ^ kaibigan[? -ko]. 
NEG=1S.GEN=3S.NOM friend=1S.GEN 
‘He is not a friend of mine? 
b. Hindi['-ko] siya ang kaibigan[-ko]. 
NEG=1S.GEN 3S.NOM NOM friend-is.GEN 


*He is not the friend of mine: 


Similarly, in an event-denoting predication such as (37), second position clitics cannot 
follow the predicate when they originate within a case-marked DP. 


(37) Tagalog 
Na-dapa[*=ko] ang kapatid[=ko]. 
BEG-fall-1s.GEN NOM sibling=1s.GEN 
"My sibling fell” 


Aldridge (2004), citing data similar to (36), essentially comes to the same conclusion. 


But if this generalization is correct, then the earlier clitic argument from (35) for a bi- 
clausal cleft structure is neutralized. Clitics are unable to escape from a DP and thus the 
genitive clitic in (35), representing an agent embedded in a nominative phrase, cannot 
follow the interrogative. 


35 Aldridge (2004: 262): “I assume that DP is a strong phase, not permitting movement from it. However, a 
predicate nominal is not, so the clitic would be able to move? 
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3.2 True biclausal clefts in Austronesian languages 


The nominal properties of “Philippine-type verbs" is largely lost south of the Philip- 
pines (Kaufman 2009b). Consequently, Malay, even in its earliest attested stages, does 
distinguish relative clauses syntactically through the use of yang. As shown earlier in 
(28), Indonesian-type relativizers like yang are functionally distinct from Philippine-type 
case marking determiners. We can further see in (38) and (39) how Indonesian-type rel- 
ativizers are distinguished syntactically from the "linker" or "ligature" found in most 
Philippine-type languages. First, yang is not required to mediate adjectival modifica- 
tion, as seen in (38a). Second, it can head a phrase without a preceding noun, as seen 
in (38b). The Philippine linker/ligature differs on both of these counts. It must mediate 
all instances of modification, as shown in (39a) and cannot surface without a preceding 
phrase.? 


(38) Indonesian 
a. rumah (yang) besar 
house RELT big 
“big house' 


b. (yang) ini 
RELT this 


“this one’ 


(39) Tagalog 
a. bahay (na) malaki 
house LNK big 


“big house' 
b. (^na) ito 
LNK this 
‘this’ 


A relative clause referring to the agent is built on an actor voice VP with the addition 
of the relativizer yang, as in (40a). As can be seen in (40b), the plain VP cannot stand in 
subject position with the same function.?? 

The presence of a dedicated relativizer is one crucial piece of evidence for the biclausal 
nature of the construction. An additional piece of evidence is the optional presence of 
the copular element adalah. 


See Yap (2011) for a further discussion of yang and its expanding functions in the history of Malay. 
20Verb phrases can also stand in subject position, typically with the help of a demonstrative, when function- 
ing as event nominalizations, as in (i). 


(i) ppl[pp[vp[Menilai kinerja mentri] itu] susah]. 


Av:evaluate output minister that difficult 
‘Evaluating the output of ministers is difficult? 
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(40) Indonesian 


a. Yang menilai kinerja menteri adalah Presiden. 
RELT Av:evaluate output minister cop president 


‘(The one) who evaluates the output of a minister is the president?! 


b. *Menilai kinerja menteri adalah Presiden. 
Av:evaluate output minister cop president 


The innovation of a copula in Indonesian languages has yet to be studied systematically. 
The copula adalah was innovated in the attested history of Malay from a presentative use 
of the existential ada in combination with the emphatic lah. Although English-like cleft 
constructions employing both the copula and a relative clause can be found in modern In- 
donesian, there remain restrictions on the use of the copula that are not well understood. 


Specifically, we find that the copula is rejected in questions like (41b), a constructed min- 
imal pair with the attested (41a). 


(41) Indonesian 


a. Dia adalah yang di-tua-kan di antara sesamanya. 
3S COP  RELT PV-old-APPL PREP among colleague 


‘It’s him who is treated as an elder among colleagues. ?? 


b. Siapa (*adalah) yang di-tua-kan? 
who COP  mELT Pv-old-APPL 


"Who is treated as an elder?' 


In line with the historical development of adalah, it is likely that it selects for a focused 
complement or at least avoids a presupposed one. This is supported by the fact that the 
copula is again possible when the interrogative is in-situ, as in (42). 


(42) Indonesian 
Yang di-tua-kan (adalah) siapa? 
RELT Pv-old-APPL COP who 


"[he one treated as an elder is who?' 


The use of a dedicated relativizer and copula in Indonesian (non-adjunct) content ques- 
tions and focus constructions shows that this language has developed bona fide biclausal 
cleft sentences where Philippine-type languages still employ an equational monoclausal 
structure. Unfortunately, the difference between Philippine-type and non-Philippine- 
type Austronesian languages in this regard has not been given much attention by syntac- 
ticians. The default hypothesis has treated languages like Indonesian as simply having 


?http;//nasional.republika.co.id/berita/nasional/politik/16/01/06/00iwu0354-jokowi-yang-menilai-kinerja- 
menteri-adalah-presiden 

22 http://nasional.kompas.com/read/2014/09/19/06431611/Artidjo.Korupsi.Kanker.yang.Gerogoti.Negara 

23'The ungrammaticality of post-interrogative copulas and copula stranding is not addressed by Cole & Her- 
mon (2000). 
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overt markers for what are null functional elements in Philippine-type languages.^* In 
the next section, we explore how the morphological glue of biclausal constructions is 
recruited from existing lexical and functional elements as part of a larger argument that 
such constructions are relatively recent innovations in the history of Austronesian. 


4 How to jerry-rig an Austronesian biclausal cleft 


We can posit a structure such as the one in (43) for an English TH-cleft (introduced earlier 
in 82.3). The focus here lies in the subject position while the presupposition is packaged 
as a DP containing a relative clause. Note that there are multiple elements within the 
presupposition that are special to this construction. 


(43) 
TP 
P dd 
DP T 
w QU 
That T VP 
is P uu 
V DP 


il oe 


CP 
oe A 
: PST 
C TP 


that| Isawt; 


The DP proper contains a determiner and a semantically bleached noun, in this case one. 
The modifying CP contains a complementizer that and, ostensibly, a null operator in 
the position otherwise reserved for interrogative elements. That these layers are distinct 


"Potsdam (2009) enumerates the pseudo-cleft analyses proposed for wh-questions across a diverse set of 
Austronesian (including both Philippine-type and non-Philippine-type) languages: Palauan (Georgopoulos 
1991), Malay (Cole et al. to appear), Indonesian (Cole et al. 2005), Tsou (Chang 2000), Tagalog (Kroeger 1993; 
Richards 1998; Aldridge 2004; 2002), Seediq (Aldridge 2004; 2002), Malagasy (Paul 2001; 2000; Potsdam 
2006a,b), Maori (Bauer 1991; 1993), Niuean (Seiter 1980), Tuvaluan (Besnier 2000), Tongan (Otsuka 2000; 
Custis 2004). Chung (2010) specifically traces the analysis of content questions in Philippine-type languages 
as pseudo-clefts to Seiter (1975). While such analyses appear well supported for many non-Philippine-type 
languages, it does not seem justified to assume the same structure for Philippine-type languages. 
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is seen both in historical stages of English, as exemplified in (44), and in non-standard 
modern English, (45). 


(44) Middle English (Chaucer's Prolog 836, cited in Curme 1912) 
He which that hath the shortest shall beginne. 
he RELT.PRON COMP has the shortest shall begin 


"He who has the shortest shall begin: 


(45) Here I am, in this room, because of an organization whose work that I deeply, 
deeply admire.?? 


In (46), the DP ‘an organization’, is modified by a CP which contains both an interrog- 
ative phase, ‘whose work’, moved to its left periphery and the complementizer ‘that’. 


(46) 
DP 
p ER SN 
p Ñ 
P din 
| NP DPi c 
"m cm 
organization whose work e TP 
that 


I deeply, deeply admire t; 


Given the distinct roles and positions of the determiner, dummy noun, interrogative 
pronoun and complementizer in the above English structures, we can now ask where 
the functionally equivalent morphology of Austronesian languages fits in, if at all. 
Adelaar (1992) argues convincingly that the ya element in yang is cognate with the 
third person singular pronoun ia and that the following velar nasal is cognate with the 
Philippine linker, which we can treat as a type of complementizer.?Ó The pronoun ia can 
furthermore be broken down into a person marking determiner element i (Ross 2006), 
plus a, a nominal head, as argued for by Reid (2002). The parts of the Malay/Indonesian 


P Ellen Page, "Time to Thrive” Conference, 14 February 2014, Human Rights Campaign video, 0:26, posted 
and accessed 15 February 2014. Cited from Beatrice Santorini's doubly filled comp example webpage: http: 
//www.ling.upenn.edu/~beatrice/examples/doublyFilledCompExamples.html. 

26Reid (2002) argues for a similar analysis of Philippine case markers, in which they consist of a nominal head 
plus a linker. Translating Reid’s proposal to the current framework, a case marker like Tagalog ang would 
have an extremely similar structure to Indonesian yang. This opens up a possibility whose implications 
I cannot fully address here, namely, that every Philippine-type DP is akin to a relative clause headed by 
a dummy nominal. There is some evidence to recommend such a view. Philippine-type DPs can contain 
a larger range of syntactic material than might naively be expected from an Indo-European perspective. 
For example, a case marking determiner can have as part of its complement negation and an independent 
tense domain, as shown in (i). 
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relativizer thus fit cleanly into the earlier template motivated by English, as shown in 
(47). 


(47) 


aN 


"H 


Similarly, Káhler (1974) shows that Ngaju Dayak and Old Javanese recruit demonstratives 
to play the role of relativizer, which can be located in the same DP occupied by ia above. 

Other languages make use interrogative elements, which we locate on the left branch 
of CP. In two pioneering investigations of relative clauses in Indonesian languages, 
Gonda (1943) and Kahler (1974) note the frequency with which “anu is used as a rela- 
tivizer, as in (48).77 


(48) Sundanese 
Moalaja deui hajam (a)nu bisa hibar lapas. 
NEG EXT anymore chicken RELT can fly fast 


“In no case are there anymore chickens which can fly fast? (Kähler 1974: 264) 


In Sundanese, the relativizer is anu, a cognate of what Blust & Trussel (2010+) reconstruct 
as PMP *a-nu “thing whose name is unknown, avoided, or cannot be remembered: what?” 


G) Tagalog 
Ku-kun-in ko bukas ang hindi mo k<in>ain-Ø kahapon. 
IMPRF-take-Pv 1S.GEN tomorrow NOM NEG 2S.GEN <BEG>eat-PV yesterday 


“I will take tomorrow what you didn't eat yesterday: 


Evidence against treating all DPs as full clauses in Philippine-type languages includes the impossibility 
of dependent form imperatives in DPs (83 above) as well as the marked nature of topicalization within 
DPs. The latter argument, however, is weakened by the fact that relative clauses can also plausibly exclude 
a position for fronted topics. If a relative clause analysis is justified for Philippine-type DPs, then the ty- 
pological division between languages like Tagalog and Indonesian would have to be characterized not as 
Philippine-type languages lacking relative clauses but rather lacking bare noun phrase arguments. Histor- 
ically speaking, bare noun phrase arguments and dedicated relative clause markers clearly appear to be 
innovations in languages south of the Philippine area. 

27Kahler (1974) further notes that it appears impossible to reconstruct a dedicated relativizer with any real 
time depth. I attribute this here to the fact that such elements are not necessary in Philippine-type lan- 
guages whose event-denoting predicates are already noun-like and can thus serve as direct complements 
of determiners. 
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Sangirese apa(n), on the other hand, is cognate with Blust & Trussel's reconstruction 
of PMP “apa ‘what?’ and shows evidence for a following nasal linker. The Sangirese 
relativizer thus fits into our schema as shown in (50). 
(49) Sundanese 
I sire apan mam-pangasi’ su sangkamisa naliue, niuntung bue. 
PM 3P what:LNK Av-plantrice in one-week past pETlucky EMPH 
‘They who planted rice last week, are lucky’ (Kahler 1974: 269 citing Adriani 
1893-1894) 


(50) 


Yet other languages make use of a bleached noun alone. This strategy is extremely com- 
mon in Sulawesi where we find various derivations of PMP “tau ‘person’, most often 
in the reduced form to, as in Kulawi (51). The presupposed portion of the clause can be 
analyzed simply as (52), where all the functional positions are left empty except for the 
bleached noun. 


(51) Kulawi (Adriani & Esser 1939: 30) 
Ba bangkele to ` na-mate? 
OM woman RELT PRF-die 


"Was it a woman who died?’ 


(52) 


DP 
= Se 
DP CP 


SS 


D NP C 

E S 
C TP 
zi. 
namate 
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Balantak, a language of the Saluan-Banggai subgroup spoken in the eastern side of Cen- 
tral Sulawesi and recently described by van den Berg & Busenitz (2012), displays a fasci- 
nating combination of features that put it squarely between Philippine and Indonesian 
typologies. Like Philippine-type languages, it has a largely intact voice system and the 
remnants of a case marking system for NPs. The case marker a indicates the subject (i.e. 
the patient of patient voice, agent of agent voice, etc.) when it is post-verbal, as shown 
in (53). Just as in Tagalog and other Philippine languages, this marker also functions as 
a definite determiner. 


(53) Balantak (van den Berg & Busenitz 2012: 47-48) 


a. Ma-polos tuu a  sengke-ku. 
INTR.I-hurt very ART back-1s 


‘My back really hurts? 
b. Boit-i-on a  piso-muu kabai sobii? 
sharpen-APP-PV.1 ART knife-2» or  letit.be 


‘Should your knife be sharpened or shall we just leave it?’ 


As in Philippine-type languages, the case marker still allows for complements of all lex- 
ical categories, as seen in (54). van den Berg & Busenitz (2012) term such constructions 
"semi-clefts". 


(54) ..raaya'aa | mam-bayar. 
3P ART AV.I-pay 


“they were the ones who paid. (van den Berg & Busenitz 2012: 50) 


Remarkably, Balantak has also developed a relative marker men from the bleached noun 
mian ‘person’ (adding further support to the etymology to RELT < “tau ‘person’ in other 
languages of Sulawesi). This is seen in (55), where both the case marking determiner a 
and the relativizer men co-occur in the presupposition of question. 


(55) Ai ime a men mae? 
EMPH.ART who ART RELT go 


‘Who is going?’ (van den Berg & Busenitz 2012: 50) 


The functional structure of clefts in Balantak, shown in (56), would thus look not very 
different from Malay/Indonesian. 
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(56) 


What is unique in Balantak is that the determiner in this structure maintains a robust NP 
case-marking function and can attach directly to verbs in many contexts. Balantak thus 
offers us a live view of what must have happened throughout Indonesia. The loss of case 
marking proceeds hand-in-hand with the rise of relativizers. In Balantak questions, it is 
still the case marker which is obligatory, not the relativizer, as van den Berg & Busenitz 
(2012) show explicitly in (57). 
(57) Balantak 
a. Pi-pii takalana (men) ala-on-muu? 
RED-how.manyliter ^ ART RELT take-pv.1-2P 
'How many liters will you take?' 
b. *Pi-pii takalan (men) ala-on-muu? 
RED-how.manyliter ^ RELT take-pv.1-2P 


'How many liters will you take?' 


But the functional scope of the case marker has also clearly shrunk in comparison to 
typical Philippine-type languages. Specifically, the nominative determiner a only occurs 
post-verbally in Balantak whereas in Philippine languages we find no such restriction. 
The loss of this domain would have given rise to the need for a relativizer men in positions 
where a was no longer licensed.?? 


5 Referentiality and predication 


An idea was put forth earlier that the less referential half of a predication is assigned 
to the predicate position of a clause while the more referential half is packaged as the 


28Like Balantak, Malagasy also instantiates an intermediate position between canonical Philippine-type lan- 
guages and Indonesian. It is more complex than the languages considered here in possessing a distinct (i) 
focus complementizer no, (ii) relativizer izay and (iii) NP marker ny. Keenan's (2008) analysis of Malagasy 
is close to the one advocated here for Philippine-type languages although Paul (2001); Law (2007); Kalin 
(2009); Pearson (2009); Potsdam (2006b) show that the syntax of Malagasy cleft constructions is clearly 
non-equational. Potsdam's (2006b: 220-225) examination of the Malagasy CP in clefts is especially rele- 
vant here but space considerations preclude a full comparison. 
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subject, in line with work on copular clauses in English and other well studied languages. 
We have also seen in the above how the predicate position in Austronesian languages 
functions as a kind of de facto focus position by virtue of Austronesian languages tending 
to package presuppositions as subjects. The mechanics of this turn out to contain some 
surprises. 

First, note that a bare predicate phrase in Tagalog, whether it is headed by an entity- 
denoting, property-denoting or event-denoting word, must precede the subject, as ex- 
emplified in (58) and (59).2? The basic word order in Tagalog and the vast majority of 
Philippine-type languages is thus regularly described as predicate-initial on this basis. 


(58) a. Guro ako. 
teacher 1S.NOM 


‘I am a teacher’ 


b. "Abo guro. Cf. *A teacher is me. 
1S.NOM teacher 


(59) a. Matangkad ako. 
tall 1S.NOM 


Tam tall” 


b. *Ako matangkad. Cf. *Tall is me. 
1S.NOM tall 


A paradox surfaces, however, when both parts of the predication are referential or def- 
inite. In such cases, it appears that the more referential portion of the predication must 
be located in the clause-initial predicate position. In a neutral context, that fills the sub- 
ject position in the English translation of (60). In Tagalog, the demonstrative must be 
positioned in the clause-initial predicate position. In an English copular clause with a 
pronominal argument and a definite NP, the pronominal argument will be selected as 
the subject. In Tagalog, the pronominal argument must always be in predicate position 
when the other argument is definite, as seen in (61).?? 


(60) a. Iyan ang problema. 
that.NOM NOM problem 


‘That’s the problem. (Lit. “The problem is that.) 


??Topicalization is possible to achieve the subject initial orders here but it is marked either by the topic 
marker ay or a very clear intonational break after the subject. In short sentences like (62), the intonational 
break may be more difficult to hear. Speakers seem to agree however that for the order in (b) to be licit, 
there must be distinct phonological phrases while this is not true for the (a) sentences. It is in fact possible 
to make the judgments completely unambiguous through the use of clitics. Specifically, we can compare 
sentences like the following where the second position pronoun has two forms, a long form, ikaw, used in 
predicate position, and a clitic form -ka, used for arguments. When the second person is in a predication 
with a demonstrative, the clitic form is ungrammatical: Ikaw iyan 2s.NoM that.NoM "Ihat's you' versus 
"Iyan-ka ‘that.NoM=2sG.Nom’. When the demonstrative is topicalized, the second person pronoun retains 
its predicate form, Iyan, ikaw that.NoM 2s6.NoM “That, is you’. 

30 As noted in fn.13, Kroeger (1993: 148-149) analyzes such constructions as inversions where the first con- 
stituent is the subject and the latter constituent is the predicate. All evidence, however, points to the initial 
constituents in such sentences behaving as predicates, leading Kroeger (2009) to revise his original analysis. 
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b. *Ang problema iyan. 
NOM problem that.NoM 


(61) a. Akoang guro. 
1$ NOM teacher 


‘Tam the teacher. 


b. *Ang guro ako. Cf. "The teacher is me. 
NOM teacher 1s 


In (62), English and Tagalog again agree in placing the demonstrative in the subject 
position and the first singular pronoun in predicate position. 


(62) a. Ako iyan. 
1S.NOM that.NOM 


“That's me? 


b. ?*Iyan ako. 
that.NOM 15.NOM 


Based on the above data, we can no longer say that Tagalog merely displays the mirror 
image of the English subject-predicate order. While both Austronesian languages and 
English enforce a familiarity condition on subjects (see Mikkelsen 2005: chap.8, for a 
summary of the English facts), there appears to be an additional role for an extended 
definiteness or animacy hierarchy in Tagalog and other Philippine languages. The in- 
volvement of an animacy hierarchy is clear from the following facts. Just like demon- 
stratives, a third person pronoun must be in predicate position if the other half of the 
predication is definitely determined, as seen in (63). But when a third person pronoun 
is in competition with a first person pronoun for predicate position, it is the first person 
which wins, as shown in (64). 


(63) a. Siya ang problema. 
3S.NOM NOM problem 
‘S/he’s the problem. (Lit. “The problem is s/he.) 


b. *Ang problema siya. 
NOM problem 3S.NOM 


(64) a. Ako siya. 
1S.NOM 35.NOM 

'S/he's me? 

b. *Siya ako. 
3S.NOM 1S.NOM 


Although space does not permit a full demonstration of all the possible interactions 
between NP types, the rules follow a slightly modified version of Aissen's (2003: 437) 
definiteness hierarchy, shown in (65). When both halves of a predication are referential, 
the portion higher on the scale in (65) will be selected as predicate. 
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(65) DEFINITENESS/ANIMATENESS HIERARCHY (Silverstein 1976; Aissen 1999; 2003) 
local [1/2] person > third person pronouns > demonstratives/proper name > Defi- 
nite NP > Indefinite Specific NP > Non-Specific 


The only real optionality, as indicated by the lack of ranking above, is found with demon- 
stratives and proper names. When these two types are in a predication relation, either 
order is acceptable, as seen in (66). This can potentially be linked to the ability of proper 
names in Tagalog to behave like pronominal clitics (Billings 2005). 


(66) a. Iyan si Boboy. 
that.NoM NoM Boboy 
"Ihat's Boboy? 
b. Si Boboy iyan. 
NOM Boboy that.NoM 
"Ihat's Boboy? 


In predications where the order is fixed by virtue of the definiteness hierarchy, infor- 
mation structure is flexible. For example, the sentence ako ang guro ‘I am the teacher’ 
can answer both the question in (5) as well as that in (5). This is unusual in Philippine 
languages, as the clause-initial predicate position is otherwise reserved for the focus of 
the sentence rather than the presupposition.?! 


(67) A: Sino ang guro? 
who.NOM NOM teacher 
"Who is the teacher?' 
B: Ako ang guro. 
1S.NOM NOM teacher 


‘I am the teacher: 


3! Aldridge (2013) claims that in predications with two definite DPs (two ang phrases), the first is always the 
focus, exemplified with (i). I am not convinced that a focus reading is necessary or even unmarked on the 
first ang phrase in (i.b). Previous authors have disagreed on the pragmatic status of double ang phrase 
predications in Tagalog. Aldridge argues that predicate fronting in Tagalog (to derive the basic word order) 
is movement to a focus position. My feeling is rather that the focus interpretation of the predicate is a result 
of packaging presuppositions as definitely determined subjects. Once the presupposition is subtracted, the 
left-overs in clause-initial position canonically align with the focus. Examples such as (i) are critical to 
adjudicating between these two analyses but this must be left to further work. 


G) a. [Ang lalaki] ang na-kita ng babae. 
NOM man NOM NVOL.PV-see GEN woman 
"Ihe man is who the woman saw: 


b. [Ang na-kita ng babae] ang lalaki. 
NOM NVOL.PV-see GEN woman NOM man 


"Ihe man is who the woman saw? 
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(68) A: Sino-ka? 
who.NOM-2S.NOM 
"Who are you?' 
B: Ako ang guro. 
1$.NOM NOM teacher 


‘Tam the teacher: 


I would like to offer a potential solution to the paradox of why it is the more definite 
or referential element that becomes the predicate when both elements are referential, 
in stark contrast to the canonical packaging of new information as predicate. The pat- 
tern can be accounted for by viewing it as the product of two potentially conflicting 
constraints. On one hand, presuppositions are packaged as ang phrases and what is left 
in the clause-initial position is the de facto focus. The only principle that predicate se- 
lection in the strict sense takes into account is whether an element is definite or not. If 
one element is definite and the other is not the story ends there; the definite element 
is packaged as subject while the remainder is placed in predicate position. If both ele- 
ments are definite, another principle comes into play which only relates secondarily to 
the subject-predicate relation. This principle demands that elements higher on the defi- 
niteness/animacy hierarchy linearly precede those which are lower on the hierarchy. The 
clause-initial predicate position is then pressed into service to make the more animate 
element precede the less animate one. 

Several pieces of evidence from other Austronesian languages support this analysis. 
First of all, as discussed in Kaufman (2014), many Indonesian languages have indepen- 
dently arrived at a split proclitic/enclitic system for agent marking. In all attested ex- 
amples, third person markers procliticize only if the local persons [1/2] have procliti- 
cized. First person furthermore tends to procliticize before second person. This can be 
seen clearly in the languages of Sumatra, as shown in Table 3 and equally compelling 
evidence is found in the languages of Sulawesi. On one end of the spectrum, all pronom- 
inal agents were enclitic in Old Malay. On the other side of the spectrum, Minangkabau, 
all such agents are expressed as proclitics. In between, Karo Batak, Gayo and Classical 
Malay which show a development that respects the animacy hierarchy such that the 
agents higher on the hierarchy must precede those which are lower. 

In an independent development in several languages of Mindanao in the Philippines, 
the animacy hierarchy also determines the order of clitics within a clitic cluster (Billings 
& Kaufman 2004; Kaufman 20102). For instance, in Maranao, a first person clitic always 
precedes a second person clitic and both first and second person clitics precede third 
person clitics, as seen in (69). 


?? Split proclitic/enclitic patterns in the languages of Sulawesi are argued by van den Berg (1996) to have de- 
veloped from a full proclitic pattern and by Himmelmann (1996) from a full enclitic paradigm. The history 
and typology of pronominal proclisis is further discussed by Wolff (1996); Mead (2002); Kikusawa (2003); 
Billings & Kaufman (2004). I believe the comparative evidence points very clearly to split-paradigms re- 
sulting from partial accretion rather than loss, besides the obvious preference of Occam's razor for such 
an account, but the details do not concern us here. 
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Table 3: Person marking in the patient voice (Kaufman 2014) 


Old Malay KaroBatak Gayo Clas. Malay Minangkabau 

1SG. ni-V-(y)ku ku-V ku-V ku-V den-V 

28G. (ni-V-mamu)  i-V-ogko i-V-ko kau-V ap-V 

3SG. ni-V-fia i-V-na i-V-é di-V-fia iño-V 
IPLEXCL ? i-V-kami kami-V kami-V kami-V 
IPLINCL  ni-V-(n)ta si-V kitó-V ^ kita-V kito-V 

2PL. ni-V-mamu i-V-kam i-V-kam | kamu-V kau-V 

3PL. ni-V-(n)da i-V-na i-V-é di-V-mereka ino-V 


(69) Maranao (Kaufman 2010a) 


a. M<iy>a-ilay=ako=ngka. 
<PRF>PV.NVOL-see=1S.NOM=2S.GEN 
“You saw me? 

b. M<iy>a-ilay=ngka=siran. 
<PRF>PV.NVOL-see=2S.GEN=3P.NOM 


“You saw them: 


Both of these phenomena offer support for the idea that there is an earliness principle at 
play which makes use of the definiteness/animacy hierarchy. A prediction of this anal- 
ysis, which is driven by linear precedence, is that no subject-predicate paradox of the 
type found in Tagalog should exist in Austronesian languages with basic SVO word or- 
der. This is because the argument which is higher on the definiteness/animacy hierarchy 
will both make for a more natural subject and naturally precede the predicate in such 
languages. This prediction is at least borne out in Indonesian. As seen in (70), even a sub- 
ject low on the animacy/definiteness hierarchy precedes the predicate in the unmarked 
word order. In a copular sentence such as that in (71), where a first person pronoun is 
in a predication relation with a definite NP, the pronouns still takes the canonical sub- 
ject position. Unlike Tagalog, it cannot felicitously be positioned in predicate position 


without special topic-focus intonation. 


(70) Indonesian 
Serigala bisa membunuh orang. 
wolf can av:kill person 


"Wolves can kill people’ 


(71) Indonesian 
a. Aku guru-nya. 
ıs  teacher-pEF 


'Tm the teacher? 
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b. #Guru-nya aku. 
teacher-DEF 1s 


Unfortunately, this topic has been left almost completely unexplored for other languages 
of Indonesia and so it is not yet possible to compare SVO languages of Indonesia with 
predicate-initial ones more broadly. The predictions of the current analysis are clear 
though that the unexpected inversions found in Tagalog should only occur in predicate- 
initial languages. 


6 Conclusion 


I have explored here several related aspects of predication and information structure in 
Austronesian languages. I began by arguing for a monoclausal analysis of apparent clefts 
in Philippine-type languages and tying this to the nominal nature of Philippine-type 
verbs. I then showed how true biclausal clefts emerge in Indonesian languages where the 
noun-verb contrast is more robust. In such languages, presupposed verbal material must 
be relativized before it can occupy subject position. While Indonesian relativizers come 
from varied sources (bleached nouns, interrogatives, pronouns in combination with the 
linker), it was shown that all patterns under examination fit neatly into a common syn- 
tactic template. Finally, I made an attempt at solving a paradox in the subject-predicate 
relation of Philippine-type languages. I argued that in addition to a canonical familiarity 
condition on subjects, there exists a linearity condition which requires that the part of 
a predication which is higher on the definiteness hierarchy precede the part which is 
lower. The prediction, which requires further exploration, is that SVO languages should 
not display these unexpected inversions. 

It perhaps deserves emphasizing here that syntacticians have been too hasty in posit- 
ing English-like constituency structures and lexical categories in the analysis of Aus- 
tronesian languages. Consequently, important differences between Philippine-type and 
non-Philippine-type Austronesian languages have been masked. By stepping back from 
these assumptions, we can begin to explore fundamental problems in the relation be- 
tween predication and information structure. Although the present work has only 
scratched the surface, it has hopefully opened a path for further research in how this 
relation varies across Austronesian languages. The resolution of this problem in Aus- 
tronesian may very well contribute to answering the philosophical questions around 
predication first put forth by Plato and Aristotle over two millennia ago and debated 
today. 


Abbreviations 

APPL applicative BEG begun aspect 
ART article COMP complementizer 
AV actor voice COP copula 
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DET determiner NVOL non-voluntary mood 
EMPH emphatic OBL oblique case 

EXT existential PM personal marker 
GEN genitive case PRF perfective aspect 
IMPRF imperfective aspect PST past tense 

INTR intransitive PV patient voice 

LNK linker QM question marker 
NEG negation RED reduplication 

NOM nominative case RELT relative marker 
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In this paper we investigate the influence of two information structure (IS) related aspects 
on the choice of voice form and sentence structure by Tagalog speakers. The first is the infor- 
mation status of argument referents. Tagalog is a multiple voice language, so almost every 
semantic argument in a sentence can be turned into the privileged syntactic argument (or 
subject) and be rendered salient. Information status of the undergoer has been argued to play 
an important role in voice and subject selection. The second IS-related aspect is the inherent 
structure of a discourse as determined by the implicit questions under discussion (QUDs) 
that are answered with each subsequent sentence in a text. The default sentence in Tagalog 
starts with a verb. Inversion constructions, i.e. sentences that start with an argument phrase 
instead of a verb, are described as motivated by information structure considerations such 
as focus-background or contrastive-topic-focus packaging. Based on a novel QUD approach, 
we will work out the discourse structure and at-issue contents of five short texts and show 
the important role of implicit QUDs and parallelisms on the choice of voice and constituent 
order. 


1 Tagalog: Voice, information structure and inversion 


11 Multiple voice system 


Tagalog is a verb-initial language known for its multiple voice system. For an incremental 
theme verb like /sulat/ (actor voice: sumulat; undergoer voice: sulatin) ‘to write’ we find 
a set of voice affixes on the verb that may pick out the semantic core arguments or one 
of the peripheral arguments as the privileged syntactic argument (PSA) - the subject - 
of the respective sentence. The PSA is marked by ang if it is a common noun and si if 
it is a personal name. The ang- or si-marked reference phrases (RPs, cf. Van Valin 2008) 
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tend to receive a definite or at least specific interpretation, and usually appear after the 
other core arguments, as shown in (1). Actor and undergoer voice sentences differ in that 
undergoers! expressed by common nouns preferably get an indefinite (and maybe even 
non-specific) reading in actor voice sentences, e.g. ng liham in (1a), while they preferably 
receive a definite or specific reading in undergoer voice sentences, e.g. ang liham in (1b). 
Note that the actor tends to receive a definite reading regardless of case marking. 


(1 /sulat/ ‘write’ 
a. um- actor voice 
Hindi s<um>ulat [ng liham]pyg kay Lisa [ang babae] Aer, 
NEG <AV.RLS>write GEN letter DAT Lisa NOM woman 


"Ihe woman did not write a(ny) letter/letters to Lisa? 


b. An: (incremental) theme voice 
Hindi s<in>ulat [ng babae]Acr kay Lisa [ang liham] yg. 
NEG <UV.RLS>write GEN woman DAT Lisa NOM letter 


"Ihe woman did not write the letter/a (certain) letter to Lisa? 


As mentioned above, it is not only the agent or the theme argument that may be singled 
out as salient with a verb like write, as the examples in (2) show. Note that if an argument 
other than the agent or the theme is chosen as PSA, the information status of the theme 
argument is unspecified. Sentence (2b) exemplifies that thematic role labels (here: theme) 
are sometimes not fine-grained enough to capture the semantic properties that the voice 
affixes pick out. We will not be concerned with the intricacies of so-called peripheral 
voice forms in this paper. The examples are only meant to show that the voice system 
serves first and foremost to make different semantic arguments in a sentence salient and 
that only one argument at a time may be marked as salient in a sentence. 


(2 a. -an: goal voice 
S<in>ulat-an ng babae ng lihamsi Lisa. 
<UV.RLS>Write-GV GEN woman GEN letter Nom Lisa 
"Ihe woman wrote Lisa a/the letter/letters: 
b. i- (transferred) theme voice 
I-s<in>ulat ng babae ang pangalan ni Lisa sg papel. 
TV-<UV.RLS>write GEN woman NOM name GEN Lisa DAT paper 


"Ihe woman wrote Lisa s name on (the) paper: 


1As it is common, we use the macro-role label undergoer to refer to various non-actor roles. 
"The (specific) indefinite reading of the undergoer can be enforced by inserting the numeral isa-ng ‘one’. 


(i) -in: (incremental) theme voice 
Hindi s« in» ulat ng babae kay Lisa ang isa-ng liham. 
NEG <UV.RLS>write GEN woman DAT Lisa NOM one-Lk letter 


"Ihe woman did not write a certain letter to Lisa’ 
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c. i-pan(g)-: instrument voice 
I-p<in>an-sulat ng babae kay Lisa ng liham ang lapis 
IV-APPLINsrR «UV.RLS»-Write GEN woman DAT Lisa GEN letter Nom pencil 
mo. 
your 


"Ihe woman wrote the/a letter/letters to Lisa with your pencil: 


So far there are no elaborate corpus studies exploring whether the definiteness/specifi- 
city associated with ang-marking is about uniqueness, identifiability, familiarity, given- 
ness, a certain kind of accessibility or other information-structurally relevant features. 
In this paper we are going to approach the problem from the other side. We will analyze 
the information structure and discourse structure of textual data and subsequently study 
the influence of information-structural constellations on voice selection. By constellation 
we mean, in particular, the focus-background structure of the entire clause within its dis- 
course context rather than isolated properties such as the givenness of a single argument. 
The goal is to make further contributions to the investigation of which discourse prop- 
erties exactly motivate voice and construction selection in a given text. To this end we 
will look at the results of a Tagalog translation and evaluation study aimed at getting 
speakers to produce and rate the same sentence in five contexts that differ with respect 
to the givenness of the arguments and the overall discourse structure. Before we turn to 
this study, however, a few more words need to be said about what has been found so far 
with respect to voice, constituent order and information structure coding in Tagalog. 


12 Preliminary assumptions on information status, event type, 
information structure and the prominence of the undergoer 


Philippine languages have been characterized as patient-prominent languages by Cena 
(1977), De Guzman (1992) and others, in the sense that undergoer voice has been said 
to be more frequent than actor voice in text counts, at least in transitive sentences (cf. 
Payne 1994; Sells 2001). Maclachlan (2002) among others argues that this observation 
does not hold unequivocally for Tagalog. Investigating several Philippine languages, No- 
lasco (2005), who proposes an ergative analysis of Tagalog, characterizes Philippine erga- 
tivity as speakers giving "the highest degree of prominence to the most affected entity" 
(ibid, p. 236), i.e. the entity viewed as most saliently affected by the event is said to re- 
ceive the absolutive? case (ang-marking). For the most part, the most affected argument 
in transitive scenarios is the undergoer, but may be the actor if the undergoer is less 
identifiable. So, basically, he also recurs to the degree of referentiality (i.e. the informa- 
tion status) of the undergoer as one of the essential factors for voice selection. The idea 
of ang-marking as prominence marking is taken up by Latrouite (2011), who suggests 
three levels of evaluation with respect to the relative prominence of an argument: (i) the 
referential level, (ii) the event-structural level, and (iii) the information-structural level. 


3Since we do not subscribe to the ergative analysis, we will gloss ang as nominative instead. 
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With respect to the referential level there are two possible scenarios: (i) one argu- 
ment is higher on the scale of referentiality than the other (given > familiar > unfamiliar 
> non-unique > non-specific) and, therefore, considered to be more prominent, or (ii) 
both are equally referential, but the referentiality of one of the arguments is less ex- 
pected and, therefore, this information is treated as more salient. Expectancy is based 
on the frequency of a pattern. As Primus (2012) and others have pointed out, actors tend 
to be referentially independent and definite, while undergoers tend to be referentially 
dependent on the actor and are less often definite. Therefore, we find cross-linguistically 
a tendency to develop marked morphosyntactic coding for definite undergoers. 

With respect to the level of event-structural prominence, we can distinguish under- 
goer- from actor-prominent and neutral verbs. Clearly undergoer-oriented verbs are 
change-of-state verbs like to scare or to kill that only provide information on the result 
state of the undergoer, but no information regarding the specific activity of the actor. 
Actor-oriented verbs are manner of action verbs (e.g. to devour) and certain stative verbs 
(e.g. to fear), and neutral verbs comprise punctual contact verbs like to hit. Schachter 
& Otanes (1972) provide examples showing that an undergoer-oriented verb like to kill 
or to scare cannot be used with actor voice in an unmarked, verb-initial sentence, as 
exemplified in (3a), but only with undergoer voice, as in (3b). 


(3) a. *T<um>akot sa mgababae si Lena. 
<AV.RLS>scare DAT PL woman NOM Lena 


Intended: “Lena scared the women? 
b. T-in-akot ni Lena ang isa-ng babae. 
<UV.RLS>scare GEN Lena NOM ONE-LK woman 


“Lena scared a woman: 


This suggests that the voice alternation we are interested in is not available for these 
verbs. Note, however, that there are examples of actor voice forms of these verbs, even 
with definite undergoers, as example (4b) shows. The sequence in (4) is taken from a 
blog discussing a movie. The movie is about sisters that suffer from different kinds of 
traumas. 


(4 a. Kay Angel, ang multoay ang pagiging — alipinsa alak at sigarilyo. 
DAT Angel, NOM ghost INV NOM developing slave par alcohol and cigarette 
‘For Angel, the ghost is the developing addiction to alcohol and cigarettes: 
b. [Kay Sylvia,] yg t<um>a-takot [sa kaniya]uc [ang pagpapalaglag sa 
DAT Sylvia <AV.RLS>IPFV.scare DAT her NOM abortion DAT 
kanya-ng magiging anak] Aer, 
her-LK developing child 
‘As for Sylvia, the abortion of her developing child scares (is haunting) her’ 
http://panitikanngpilipinas17.blogspot.de 


^Compare also the givenness hierarchy by Gundel et al. (1993) and the comprehensive overview on infor- 
mation status by Baumann & Riester (2012). 
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Latrouite (2011) observes that some kind of event-related markedness seems to be nec- 
essary to license actor voice of an inherently undergoer-oriented verb in a verb-initial 
clause. In her examples, the verb is always marked for imperfective aspect, inducing a 
habitual reading, and the actor argument is an inanimate causer rather than a volitional, 
controlling agent. She suggests that it is these divergent properties that license the use 
of the marked actor voice form. Note that in the example given in (4b), the sentence 
is also information-structurally marked, as the undergoer is a contrastive topic and ap- 
pears in the left periphery. The new, focal information in (4b) is, thus, provided by the 
actor phrase the abortion of her developing child. The example is couched in a series of 
contrasting sentences, one of which is given in (4a), listing the afflictions and problems 
the - previously introduced - protagonists are haunted by. Latrouite (2011) suggests that 
considerations with respect to the level of information-structural prominence outrank 
considerations regarding the other two levels (event type and information status of argu- 
ments). Based on examples like (4b) as well as examples discussed further down below, it 
is argued that information structure plays an important role with respect to voice selec- 
tion. However, there is no systematic corpus study to back-up the proposal as of yet and 
no detailed information-structural analysis of the data. This paper is meant to further 
investigate the claim that not only givenness and topicality, but also focality plays an 
important role for voice selection and, therefore, for PSA-marking. 

Note that for the two not very frequent peripheral voice forms, instrument voice and 
causative voice, Nuhn (2016) finds textual givenness of the PSA to be a prerequisite in his 
preliminary corpus study. It seems to make sense that undergoer arguments can only be 
realized as the PSA if they are prementioned. However, note that the licensing of actor 
voice in (4b) rather points to focality of the actor as the decisive factor, since it is chosen 
over a given undergoer. 

Latrouite (2016) finds that the textual givenness of undergoers is not sufficient to make 
speakers choose undergoer voice. For instance, in the target sentence (5c) the premen- 
tioned undergoer (experiencer) mga negosyante 'the negotiators' (see context in 5a) is 
not marked by ang; rather it is the inanimate actor argument the rallies which receives 
ang. As we have just pointed out, /takot/ (tumakot, takutin) ‘to scare’ is clearly undergoer- 
oriented. So not only based on the givenness of the undergoer but also based on event- 
structural prominence considerations, the undergoer argument would be expected to 
turn into the PSA. However, once again, we find actor voice and, once again, the sen- 
tence is information-structurally marked. It negates the truth of the previous sentence, 
and the follow-up sentence shows that the reason for the negotiators' fear is not the ral- 
lies but Erap.? It is therefore possible that it is the contrast between the actor arguments 
that licenses the chosen actor voice form. 


>Erap is the nickname of Joseph Estrada, at the time President of the Philippines. Note, furthermore, that 
the form yung is very often used instead of ang in spoken Tagalog and seems to find its way into written 
Tagalog as well. 
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(5) 


Actor voice despite prementioned undergoer: 


. Context (undergoer voice) providing rallies and negosyante: 


Sabat naman ni Executive Sec. Ronaldo Zamora, t<in>a-takot lang 
answer likewise GEN Executive Sec. Ronaldo Zamora <Uv>IPFV-scare just 
ng rallies ang mga negosyante. 
GEN rallies NOM PL  negotiators 


"According to Executive Sec. Ronaldo Zamora, the rallies just frighten the 
negotiators/businessmen: 


. Additional context: 


“And according to the postscript by Education Sec. Andrew Gonzales, it is 
forbidden for teachers to accompany the students to the rally, and a 
discussion regarding impeachment (from class) comes on top [...] What is 
this, martial law?' 


. Target sentence: 


Hindi totoo-ng t<um>a-takot [ng mga negosyante] yg [ang rallies] act. 
Not true-LK <AV.RLS>IPFV-scare GEN PL negotiator NOM rallies 


‘It is not true that the rallies frighten the negotiators/businessmen: 


. Continuation: 
Ang t<um>a-takot [sa kanila] yg, [yung pananatili ni Erap sa 
NOM <AV.RLS>IPFV-scare DAT them NOM staying GEN Erap DAT 
puwesto] act. 
spot 


"What scares them is Erap's staying on his spot? 
(Pilipino Star Ngayon, December 12, 2000, Mag - rally or tumahimik) 


The sentence in (6) shows an example pointing in a similar direction. The example is 
taken from the Tagalog translation of the Hunger Games by Suzanne Collins (2009). In 
the preceding paragraph the reader learns that the narrator is on her way to the woods 
to go hunting, which is illegal and may result in her getting killed. The narrator muses 
about the dangers she is putting herself into and how she cannot be open and truthful 
with anyone, not even her closest family members. In this context the following sentence 
is uttered. 


(6) 


Nag-hi-hintay [sa akin]uG sg gubat [ang nagiisa-ng tao-ng puwede 


AV.RLS-IPFV-Wait DAT 1SG DAT forest NOM alone-LK man-LK can 
ko-ng pa-kita-an ng tunay na ako -si | Gale]acr. 
1SG.GEN-LK CAUS-visible-LvV GEN true LK 156.NOM NOM Gale 


"Waiting for me in the woods is the only person I can show my true self to - 
Gale' (Yung Hunger Games, p. 4) 


English original: "In the woods waits the only person with whom I can be myself: Gale" 
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The actor phrase the only person [I can show my true self to]: Gale is newly introduced 
and, based on the relative clause, also starkly contrasted to the previously mentioned 
family members that the narrator cannot confide in. The undergoer argument is the 
narrator herself, so she is clearly given. Despite this fact, actor voice is chosen and the 
new, contrasted actor argument Gale is chosen as subject/ Examples like these lead 
Latrouite (2011) to the conclusion that the focality of the actor argument may result in 
actor voice selection, a pattern that cannot be predicted from the information status of 
the undergoer alone. It has been noted before that there is a default mapping between 
givenness/topicality and the macro-role actor on the one hand, and newness/focality 
and the macro-role undergoer on the other hand (e.g. Lambrecht 1994 among others), 
and that divergence from this default mapping often results in a specific morphosyntactic 
marking in the languages of the world, cf. Güldemann et al. (2015) on African languages.? 
Therefore, it is worthwhile exploring whether the sentences above are exceptions or 
whether they point to a more systematic pattern. 


13 Hypotheses on information structure and voice marking 


Given the asymmetry with respect to the default mapping of macro-roles and informa- 
tion-structural values, the characterization of information-structural prominence needs 
to be stated as follows: 


(7 Information-structural prominence (characterization): 
A core argument is IS-prominent if it has a non-default IS value (whereby the 
default value for actors is +topical and the default value for undergoers is «focal 
in a transitive sentence). 


Based on the actor focus examples above and cross-linguistic findings that information- 
structural prominence as characterized in (7) is often-times reflected morphosyntacti- 
cally in the languages of the world, we can formulate the hypothesis that information- 
structural prominence may indeed be a decisive factor for voice selection in Tagalog. 
If voice selection is influenced by prominence considerations and IS-prominence is an 
important factor, we expect the following correlation: 


(8) IS-influenced tendencies w.r.t. voice selection with two-place predicates: 


7Note that without further context the undergoer voice form would be preferred by speakers if the undergoer 
is highly referential (active, salient, given). 


(i)  H«in-intay ako ng isa-ng lalaki/ ni Gale. 
<UV.RLS>Wait 1SG.NOM GEN one-LK man GEN Gale 


“A man/Gale waited for me. 


5The reason for this default mapping, at least in narratives, may very well be that stories tend to center 
around a small number of protagonists that engage in various activities with objects and people they en- 
counter in the course of a story, so that the information packaging of a default sentence is Actor (topical) 
Verb (focal) Undergoer (focal). 
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a. If the undergoer is topical (given), undergoer voice is preferred. 


b. If the actor is focal (new), actor voice is preferred 


It is important to note that research on information structure of the past decades has 
made it reasonably clear that the conceptual pairs focal vs. new (as well as topical vs. 
given) are closely related yet not identical to each other, see e.g. Beaver & Velleman 
(2011) or Riester & Baumann (2013). This is why, in 82.1, we are going to switch to a 
question-based definition of focus. In the current section, however, and with regard to the 
examples introduced so far, a novelty-based definition of focus is sufficient and, perhaps, 
easier to comprehend. 

The tendencies stated in (8) render clear predictions if the actor and the undergoer do 
not differ in status with respect to information structure, i.e. if both are topical or both 
are focal. If both are topical, only the undergoer is considered information-structurally 
prominent based on (7). If both are focal, only the actor is considered prominent. As a 
first hypothesis to be checked we suggest the following: 


Hypothesis (i) 


If the actor is focal (contrary to the default mapping) and the undergoer is focal (in 
accordance with the default mapping), then actor voice is chosen: 


Actorr Undergoerg — Actor Voice (preliminary) 


The situation described in Hypothesis (i) that will come to mind at first is probably so- 
called broad focus although the hypothesis can, for instance, also be applied to situations 
in which the undergoer is a (focus-like) contrastive topic and the actor is a focus, as 
witnessed in example (4b). We will say more on contrastive topics in $1.4. A second 
hypothesis to be checked is: 


Hypothesis (ii) 


If the actor is topical (according to the default mapping) and the undergoer is topical 
(contrary to the default mapping), then undergoer voice is chosen: 


Actorr Undergoerr — Undergoer Voice 


Undergoer voice could then be expected to be the preferred choice in sentences in which 
the verb or some non-core argument is focal, but both actor and undergoer are old infor- 
mation. 

There are two more possible scenarios. The first scenario is the default mapping: actor 
(topical) — undergoer (focal). Neither argument is information-structurally prominent ac- 
cording to (7) in this scenario. Therefore, the choice of the respective voice form will have 
to be based on prominence considerations at a different level. The default for discourse- 
new undergoers is that they are indefinite, i.e. not referentially prominent. Based on the 
referential prominence considerations mentioned above, we expect actor voice to be fre- 
quent in this scenario, as shown, for instance, in (1a); cf. Primus (2012) on definiteness 
as a generally marked option for undergoers/objects. However, undergoer voice is also 
found in this constellation; compare (3b). Therefore, the only claim made in Hypothesis 
(iii) is that voice selection in this scenario involves information from a different level. 
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Hypothesis (iii) 


If the actor is topical (according to the default mapping) and the undergoer is focal 
(according to the default mapping), then both voice forms may be used. The final 
choice will depend on prominence considerations at a different level (i.e. referential 
prominence or event-structural prominence). 


Actorr Undergoerg — Actor Voice v Undergoer Voice 


The last possible scenario is a very marked one: both actor and undergoer are informa- 
tion-structurally prominent given the characterization in (7). In view of the fact that a 
given undergoer is not only IS-prominent but also prominent at the referential level, we 
might expect undergoer voice to be preferred over actor voice in this case. However, as 
we have already seen in examples (5d) and (6) above, a given undergoer does not neces- 
sarily enforce undergoer voice, so focality of the actor seems to outrank the topicality of 
the undergoer in quite a few instances. 


Hypothesis (iv) 
Actorr Undergoery — Actor voice > Undergoer Voice (preliminary) 


Morphological marking on the verb is not the only possibility to overtly express IS- 
prominence. Many languages use fronting and inversion constructions as well as prosody 
to mark IS-prominence, and so does Tagalog. Therefore, we can expect that speakers may 
be able to express the IS-prominence of both arguments, albeit with different means: for 
instance, voice marking for one of the arguments and inversion for the other one. This 
is indeed found, as discussed in $1.4. 

In order to factor out event-structural prominence and to highlight the role of informa- 
tion structure, Latrouite (2016) construed contexts for one and the same target sentence, 
containing the same verb and reference phrases. The contexts were meant to clearly 
determine the information status of the reference phrases and narrow down the set of 
possible implicit questions the target sentence could be an answer to. In this chapter, 
these contexts will be examined from a discourse-structural perspective. Before we turn 
to the study, however, we need to take a look at constituent order and information struc- 
ture in Tagalog and lay out our question-under-discussion approach which we utilize to 
determine the information structure of an utterance. 


14 Constituent order, prosody and inversion constructions 


In this section, we turn our attention to other means of information-structure marking 
than voice. Kaufman (2005) puts forward what he calls the "double focus" construction 
in (9b), in which only the (non-focal) actor argument? is signalled on the verb via voice 
morphology, while the two focal undergoer arguments in-situ are marked by prosodic 
stress. Given what has been just laid out in 81.3, we would not expect focal undergoers 


?(9b) shows that a pronominal (in contrast to a nominal) PSA does not occur sentence-finally and is not 
ang-marked. 
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to trigger undergoer voice, so the data do not yet pose a challenge. There have been 
no claims so far with respect to how contrast should be marked. Prosody (indicated by 
capital letters) seems to be a possible option. 


(9) “Double focus" (Kaufman 2005: 187ff.) 
a. Dapat ba-ng mag-bigay ng pera sa mga guro? 
should Q-LK Av-give GEN money DAT PL teacher 


“Should one give money to the teachers?’ 


b. Hinde, mag-bigay ka na lang [ng KENDI]p [sa mga BATA]r. 
NEG  AV-give  2sc.NoM only GEN candy | pATPL child 


‘No, (you) just give CANDY to the KIDS: 


Very often the phenomenon that some people call “double focus" is actually a combina- 
tion of a contrastive topic (Büring 2003) plus a (contrastive) focus. Contrastive topics 
have pragmatically a lot in common with foci (both of them give rise to alternatives, cf. 
Rooth 1992). We therefore expect that our hypotheses made with respect to focality (in 
particular, Hypothesis i) also cover contrastive topics. For instance, in (4b) we already 
saw actor-voice marking in combination with a contrastive topic and a focus. 

A rather special means of focus/contrast marking is shown in example (10b), by Kauf- 
man (2005: 194): the contrastive, genitive-marked actor appears in an extraordinary po- 
sition at the end of the sentence and receives prosodic stress, while the undergoer is 
given. Thus, according to Hypothesis (iv), we would expect actor voice. However, in 
(10b) undergoer voice is found, indicating that special word order choices seem to have 
an influence on voice selection as well. 


(10 a. K<in>a-usap ng bawatpropeso ang mga estudyante, di ba? 
<UV>COM-speak GEN each professor NOM PL student, NEG Q 
‘Each professor spoke with the students, right?’ 
b. Hindi. Hindi k<in>a-usap ang mga estudyante ni [PROPESOR 
NEG NEG <Uv>coM-speak NOM PL student GEN Prof. 
MARTINEZ] p. 
Martinez 


‘No. PROFESSOR MARTINEZ did NOT speak with the students 


Note that the actual information structure of (10b) is in fact a bit unclear. It is quite likely 
that, other than what is assumed in Kaufman (2005), the extraposed phrase ni Propesor 
Martinez is actually the contrastive topic, while the focus is on the negation. Since we 
have no other piece of evidence for this particular construction, we shall have nothing 
more to say about it at this point. 


True instances of “double focus”, which may also be characterized as two parts of a single, discontinu- 
ous complex focus (Krifka 1992), are rare in comparison with the rather frequent cT-F pairs. See also the 
discussion in Riester & Baumann (2013: 216). 
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It is also possible for a contrasted core argument to appear with nominative marking 
in sentence-initial position followed by ay, while the other focal element appears in-situ 
with prosodic stress, as shown in (11). This construction is one of three inversion con- 
structions mentioned in Schachter & Otanes (1972) and Van Valin & Latrouite (2015) and 
has also been linked to information packaging considerations. Inversion constructions 
are structures that diverge from the verb-initial default word order by starting off with a 
reference phrase in sentence-initial position. The three kinds of inversion constructions 
are the so-called ay-inversion construction, the ang-inversion construction and the ad- 
junct fronting construction. In our study below, the ay-inversion construction was the 
most frequent one. According to Schachter & Otanes (1972), ay-inverted elements are 
for the most part topical arguments, but may be used for contrast. In the example in (11), 
the ay-inverted element can easily be construed as a contrastive topic and the argument 
realized in-situ as the (contrastive) focus. 


(11) Contrastive topic — contrastive focus 
[Si  Peter]|cr ay nag-ba-basa [ng LIBRO mole, 
NOM Peter INV AV.RLS-IPFV-read GEN book 2sG.GEN 


"PETER is reading your BOOK? 


Note that, despite its nominative marking, the sentence-initial element followed by ay 
does not have to be the PSA of the verb; at least in undergoer voice sentences it can be 
the actor as well, see (12a). The particle ay may also be replaced by a pause here, sig- 
nalling the clause-external position of the sentence-initial element. There are no corpus 
studies yet, but one can speculate that the construction in (12a) may be one of the ways 
to code both the topicality of the undergoer (undergoer voice) and the focality of the 
actor (ay-inversion) at the same time. We therefore tentatively assume the information 
structure indicated on the example. Note that it is not possible to ay-invert an undergoer 
in combination with actor voice, see (12b). 


(12 a. Ay-inversion sentence with two Nom-marked phrases 


[Si Peter]pay b<in>asa [ang libro mo]r. 
NOM Peter INV <UV.RLS>read NOM book your 
‘PETER read your book: 


b. * Ang libro mo ay nag-basa si Peter. 
NOM book your INV «Av.RLs»read NOM Peter 


Intended: ‘Peter read the BOOK? 


Based on these descriptions we can add a fifth hypothesis that we wish to check. 


"Even with a resumptive pronoun, the sentence is considered ungrammatical by the consultants. 


(i) "Ang libro mo ay nag-ba-basa nito si Peter. 
NOM book your INV <AV.RLS>read this.GEN NOM Peter 


Intended: “The book, Peter is reading it’ 
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Hypothesis (v) 
Actorp tay Undergoerr — Undergoer Voice (preliminary) 


Meanwhile, based on the problems with extraposition in (10b) and with ay-inversion 
in (12a), we change Hypothesis (iv) to exclude ay-inversion and non-default constituent 
order. 


Hypothesis (iv) 
Undergoerr Actorp (only default constituent order) — Actor Voice (final) 


Finally, Hypothesis (i) will be stated more precisely to explicitly also cover ay-inversion 
and contrastive topics, i.e. examples like (11). 


Hypothesis (i) 
Actorp;cT tay Undergoerp — Actor Voice (final) 


In the following, we mention a few more information-structurally relevant syntactic 
properties of Tagalog, which, however, we will not investigate any further. It should be 
noted that the ay-inversion may also be used in connection with framesetters, such as a 
temporal or local adverbial phrase, as shown in (13). 


13) Kahapon ay nag-basa ng libro mo si Peter. 
p y nag £ 
yesterday INV AV.RLS-read GEN book your Nom Peter 


"Yesterday Peter read your book? 


If Tagalog speakers wish to put narrow focus on an obliquely marked argument or ad- 
junct, they are also said to recur to adjunct inversion (Kroeger 1993), in which the adjunct 
is moved to the sentence-initial position and attracts clitics (if present) that appear be- 
tween them and the verb. In (14) the pronoun siya is a clitic. 


(14) Adjunct/oblique inversion (“emphatic” inversion, Schachter & Otanes 1972, pp. 
496-98) 


a. T<um>awa X siya sa kaniya kahapon. 
<AV.RLS>laugh 3SG.NOM DAT 3SG.OBL yesterday 


‘She laughed at him yesterday: 


b. Kahapon siya t<um>awa sa kaniya. 
yesterday 3SG.NOM <AV.RLS>laugh DAT 3sG.OBL 


‘It was yesterday that she laughed at him? 


c. Sa kaniya siya t<um>awa kahapon. 
DAT 3SG.OBL 3SG.NOM <AV.RLS>laugh yesterday 


‘It was at him that she laughed yesterday? 
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Another inversion construction is the nominative or ang-inversion, in which the PSA 
is put in sentence-initial position followed by a nominalized (ang-marked) predicate 
phrase, yielding an equative structure like in (15b). This construction is often translated 
by means of an English cleft construction, but can be shown to have a slightly different 
distribution (Latrouite & Van Valin 2016) and a different syntactic structure (e.g. Nagaya 
2007). 


(15) Nominative (ang) inversion 


a. T«um-a-tawa siya. 
<AV.RLS>IPFV-laugh 3SG.NOM 
‘He was laughing: 

b. Siya ang t<um>a-tawa. 


3SG.NOM NOM <AV.RLS>IPFV-laugh 


"He was the one laughing: 


Except for two instances of ang-inversion (see (39) in $3.5 and (40) in 83.6), speakers did 
not reproduce the last two inversion constructions in the study presented in this paper, 
although Latrouite & Van Valin (2016) could show that for the Hunger Games corpus the 
ang-inversion is more frequently used than the English it-cleft. 


2 The pragmatics of information structure and discourse 
structure 


2.1 Information structure theory: Basic assumptions and terminology 


In this and the following section, we will define the information-structural concepts and 
terminology used in the case studies of 83 from a discourse perspective. At least two 
ways of describing information packaging have been proposed in the literature; the topic- 
comment structure and the focus-background structure. Lambrecht (1994) and others 
assume that the topic is the expression with respect to which the speaker aims to increase 
the hearer's knowledge, the comment being the part which provides this knowledge. 
The focus-background packaging, on the other hand, is about the distinction of “non- 
presupposed" vs. *presupposed" material. This definition is close in spirit, though not 
in terminology, to Riester (to appear) or Riester, Brunetti, et al. (n.d.), building on the 
tradition of Alternative Semantics (Rooth 1992) and theories of questions under discussion 
(QUD) (Büring 2003; Büring 2016; Beaver & Clark 2008; Roberts 2012). It is assumed that 


This usage of the term presupposed, which can apply to denotations of non-clausal expressions and which 
is closely related to the notions given or backgrounded, stems from Chomsky (1971) and Jackendoff (1972). 
It is, however, in conflict with the use of the term in formal-pragmatic approaches, e.g. Karttunen (1974); 
Stalnaker (1974), which essentially define presuppositions as (abstract) propositional entities assumed to 
be part of the common ground, i.e. not necessarily as given. Compare also discussions in Lambrecht (1994: 
150f.), Wagner (2012) or Riester & Piontek (2015: 237f.). For a contemporary picture of presupposition and 
other types of projective content see e.g. Tonhauser et al. (2013). 
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focus is the answer to the (current) QUD, while the background is the content used when 
formulating the QUD. Elements of the background that are not topical (here, interpreted 
as not referential, cf. Jacobs 2001) are sometimes called tail (cf. Vallduví & Engdahl 1996). 


(16) Question: What is Peter doing in the dark? 
Answer: Peter is dancing in the dark. 
[ Aboutness Topic] [Comment] [ Tail ] 
[ Background ] [Focus] [Background] 
[ Focus Domain - ] 


As for the information-structural markup, we choose the conventions demonstrated in 
(17), following Riester, Brunetti, et al. (n.d.): 


(17 Qí: What is Peter doing in the dark? 
> Ay: [[Peter]y is [dancing]p in the dark]-. 


(Q)uestions under discussion and their (A)nswers share the same index. The indentation 
(>) signals the fact that answers are subordinate to their QUD in discourse structure, i.e. 
they stand in a parent-child relation, as symbolized in Figure 1. 


Qı 


Ay 
Figure 1: Simple question-answer discourse 


The focus (F) is the part of the answer that corresponds to the question element of the 
QUD. QUDs which are implicit are enclosed in curly brackets, while overt questions 
are represented without brackets. The ~ symbol (Rooth 1992) indicates a focus domain, 
whose purpose it is to delimit the area comprising both the background and the focus. 
Focus domains “match” (Büring 2008) the QUD regarding their background, and they 
also allow for the easy identification of mutually contrastive assertions. All material 
inside the focus domain that is not focal is part of the background. There is no separate 
label for the background itself but we identify referential expressions in the background, 
e.g. Peter in example (16), as aboutness topics (T). 

A further category, which is not yet satisfactorily captured by the above grouping, 
is the contrastive topic (cr). Following Büring (2003), a contrastive topic is a hybrid 
category, which shares properties with both focus and background (or aboutness topic). 
We assume that contrastive topics can only occur in combination with a focus. They 
reflect a complex discourse strategy of the kind depicted in (18) or, more abstractly, in 
Figure 2. 


3 As for functional elements (e.g. the copula is, determiners, or prepositions), we leave it open when they 
should become part of the focus and when they should not. 
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(18) Qj: (Who is doing what in the dark?) 
>Q,1: {What is Peter doing in the dark?) 
>> A,,: [[Peter]cr is [dancing]p in the dark]-. 
> Qj)»: {What is Anna doing in the dark?) 
>> Àj»: [[Anna]cr is [eating]p in the dark]-. 


Q1 
FN 
Qi; Qi 


Aij A12 


Figure 2: Question-subquestion discourse 


The complex strategy consists in the fact that a question with two question elements 
(here: who, what) are answered by a sequence of answers to subquestions (indicated Q41, 
Q1 5). Now, the expressions Peter/Anna — which are backgrounded with respect to the 
subquestions but focal with respect to the main question Ou - are contrastive topics. 
Note that contrastive topics, as in (11), can function as aboutness topics (i.e. indicate 
a discourse referent) but they need not. Throughout the languages of world, it is the 
contrastive function of expressions that leads to stronger deviations from the default 
sentence realization than the aboutness function. In other words, it seems to be more 
important to signal that a change is expected or currently happening than to signal 
that there is a mere topical continuity. For that reason we expect contrastive topics to 
trigger more dramatic changes with regard to constituent order and prosody than (non- 
contrastive) aboutness topics. 


2.2 Discourse structure trees, QUDs and well-formedness conditions 


Recently, Riester, Brunetti, et al. (n.d.) (see also Reyle & Riester 2016, Riester to appear) 
have proposed a procedure for the identification of implicit questions under discussion 
(QUDs) in textual data. We will apply their methodology in the data analyses of Tagalog 
presented in 83. The method implements an insight, going back to at least Stutterheim 
& Klein (1989); van Kuppevelt (1995); Ginzburg (1996) and Roberts (2012), that the asser- 
tions made in a text can actually be thought of as answers to implicit questions, much like 
question-answer sequences in spoken dialogues. While in much of the previous work, 
QUDs have remained an object of theoretical investigation, Riester, Brunetti, et al. (n.d.) 
develop practical linguistic annotation guidelines, which are demonstrated on the ba- 
sis of French, German and English corpus data.!4 Since the information structure of an 


HSee also Riester & Shiohara (2018 [this volume]) for an application of the QUD-tree method to Sumbawa 
(Austronesian, Malayo-Polynesian). 
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assertion is definable relative to its QUD, the benefit of enhancing a text with pragmat- 
ically defined implicit QUDs is that we gain access to the information structure of each 
clausal unit. This provides us with potential access to a large amount of data, which 
can be used to study the morphosyntactic marking of information structure in any lan- 
guage of interest. The method is more flexible, faster and cheaper than collecting data 
by means of experimental techniques, and comes with the additional advantage that the 
data under investigation are potentially more natural than, for instance, artificially pro- 
duced question-answer responses. Without going into details, the reconstruction of the 
implicit QUDs of a text is enabled because it is constrained by a number of pragmatic 
principles derived and adapted from the literature on information structure; in particular 
Rooth (1992); Schwarzschild (1999); Büring (2003) and Büring (2008): 


* For any assertion A identified in a text, its immediately dominating question Q 
must be directed at one of its constituents (Q-A-CONGRUENCE); i.e. QUDs which 
do not target any expression in the assertion are forbidden. 


e The principle of Q-GIVENNEss says that Q can only consist of material that is 
salient at the point when A occurs, where salient means that the content is already 
active in the mind of the reader before A is processed. Typically, activation results 
from previous mention. In other words, material that is given in the discourse 
counts as salient and can be used to formulate Q. 


* The Q-GIVENNESS constraint is complemented by the principle MAxiMIZE-Q-ANA- 
PHORICITY, which says that all the content that is given in assertion A should be 
reflected in the QUD; i.e. A should have a maximal background and a minimal 
focus. 


For instance, in (19) the appropriate QUD for A, must be the one indicated by Q4, be- 
cause it is this question which contains only given material (rats) and maximizes the 
background of A,. By contrast, the reader is encouraged to check that other questions 
(e.g. Is stress unhealthy? What about cats? What happened?) violate one or several of the 
constraints mentioned above. 


(19) A»: The life of rats is stressful and dangerous. 
Qı: (What about rats?) 
> A, [[Ihey]T [get chased by cats]p]~. 


The final constraint, PARALLELISM, is designed to handle contrastive information in a text. 
When two assertions A' and A" share semantic content while their remaining parts are 
interpreted as alternatives, then there is a common QUD, in which the shared material 
is reflected. In specific cases, PARALLELISM will override Q-GrvENNEss, which means that 
the QUD of two (or more) parallel assertions may contain some discourse-new material 
if it is (semantically) contained in both assertions. As an example, consider the sequence 
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in (20). The information-structurally relevant QUD is Qj; (a subquestion of Q4) which 
includes the discourse-new (but semantically shared) material chase.” 


(20) Ag: The life of rats is stressful and dangerous. 
Qı: (What about rats?) 
>Q,,: {Who chases rats?) 
>> Aur [[Cats]p chase [rats]7]~ 
>> Ay: and [[dogs]r go after [them]+]- as well. 


Finally, example (21) introduces so-called non-at-issue (NAI) material (Potts 2005), which 
we define as linguistically optional expressions or, more precisely, as discourse-new ma- 
terial which is syntactically and semantically independent and which does not contribute 
in a direct manner to answering the current QUD. Typical NAI expressions are, for in- 
stance, appositions, non-restrictive relative clauses, speaker-oriented adverbs, eviden- 
tials or adjunct clauses like in (21). 


(21) Qo: {What do cats do?! 
> Ag: [When they are in the mood,]nay [[cats]y [chase big rats]g]-. 


We will simply ignore non-at-issue material when it occurs at the beginning or in the 
middle of a clause. Non-at-issue material at the end of a clause will be treated as a new, 
independent assertion. Finally, note that discourse markers (e.g. and, or, but, although) 
or discourse particles (also, as well, only, even) are not marked as Nat but simply left 
unannotated; compare A41” in (20). 


3 Case study “The unhappy rats’ 


3.1 QUD approach to “The unhappy rats’ 


Our case study is based on data elicited in Manila. Four speakers of Tagalog (three 
of which only speak Tagalog and no other Philippine language, and one who speaks 
Palawan and Tagalog) were asked to freely translate five short texts that differ in the 
givenness of the actor argument, the undergoer argument, the verb, and in the questions 
under discussion answered by the target sentence. The target sentences themselves vary 
slightly in their formulation in order to make the discourses sound more natural. In the 
following overview of the material we focus on the givenness of the arguments in the 
target sentences. The discourse structure of the texts will be worked out in detail in the 
next section. 

In Text 1, the actor cats is the discourse topic. The verb and the undergoer provide the 
new information. 


© This example demonstrates that it makes sense to distinguish between topical reference phrases, e.g. [rats], 
and larger, non-referential backgrounds that may contain more material than just the aboutness topic, e.g. 
the phrase [chase [rats] 7]. 
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1. Cats are silly creatures with nothing but nonsense on their minds. They climb up 
on curtains, bring home mice. Cats also chase and catch big rats, when they get a 
chance. Who wants to have a big rat in their house? 


In Text 2, the undergoer rats is the discourse topic, and there is a narrow contrastive 
focus on the actor cats. 


2. Itisnotonly wolves and foxes that threaten rats and catch them. Cats also catch rats 
and eat them afterwards. 


In Text 3, the undergoer rats is the discourse topic. The actor cats and their activity with 
respect to the undergoer is the new information. 


3. Rats live stressful and dangerous lives. The noise of the traffic makes them nervous 
and sick. Dogs chase them. And also (our domestic) cats catch and kill rats, when 
they get the chance. 


In Text 4, the discourse topic is the cruel laws of nature. All sentences are about predators 
catching certain animals. Thus, the verb combination chase and catch is given, while the 
actor cats and undergoer rats are new/contrastive. 


4. Lifein the wilderness is pretty cruel. Lions catch antelopes, sharks catch tunafish and 
happen to get caught and killed by humans themselves. Even here in the city these 
cruel laws of nature can be observed. (Our domestic) cats also chase and catch rats, 
and some also bring them home to continue playing with the bleeding creature. 


In Text 5, the discourse topic is the unhappiness and violence that the narrator witnesses. 
All follow-up sentences are all-new sentences that elaborate on what kind of violence 
the narrator observes. 


5. When I look out of the window, I see only unhappiness and violence. Dogs chase hens 
and make them lose their feathers. Old bitter women scream at children and make 
them cry. And also (our domestic) cats catch and kill innocent rats, when no one is 
looking. 


A PhD student and one of the authors presented the Tagalog translations to nine native 
speakers of Tagalog in Manila in a private setting. The consultants had different educa- 
tional backgrounds and were aged 20-63. They were not aware of the original English 
texts and were asked to read, evaluate, correct and improve the Tagalog versions of the 
five texts. They were also asked which translations they liked better. Interestingly, the 
participants never corrected sentence structure or voice selection, but only vocabulary 
choices. When asked explicitly about variants that could be found in different texts with 
respect to syntax and voice, they stated their preferences, but none of the translations 
was rejected as awkward. We therefore consider the translations we got as acceptable 
and natural Tagalog. 
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The nature of the texts led to translations with a lot of parallelisms and the occurrence 
of the particle rin 'also' in the target sentence Cats also catch rats, regardless of whether 
or not this particle had been given in the English original. As pointed out by Krifka (1998), 
particles like also tend to have an associated constituent, which is often the contrastive 
topic. 

The overall goal ofthe study was to find out how far the difference in textual givenness 
of the arguments would influence voice selection and construction choice in the target 
sentence. Recall that under the QUD approach (Riester, Brunetti, et al. n.d.) the crucial 
information-structural classes distinguished are focus, background (including aboutness 
topic), contrastive topic, and non-at-issue material, while a differentiation between new 
and contrastive focus is not made. 

In the following sections we will provide an analysis of the discourse structures and 
the implicit questions under discussion giving rise to comprehensive information-struc- 
tural analyses. Remember that the discourses shown in $3.2-83.6 differ from the ones 
introduced above in this section, since we present the free Tagalog translations of the 
original texts as well as their re-translations into English. Based on the analyses we can 
check whether our expectations regarding the coding of the target sentence and the 
actual coding choices by native speakers match. 


3.2 Text 1 - Intended: Actor (topical) Verb (focal) Undergoer (focal) 


Qo: (What is the 
way things are?) 


Qo: (What 
do different 
animals do?) 


Qo11: (What 
do cats do?) 


" an c 


SS? Few de Qoi: {What ex- 
SECH actly do they do?) 


without sense. 
LEA IE Cats also 


Aoiir: They climb A,,,,: and bring catch big mice (= 
up on curtains, home mice. rats) (when there 
is an opportunity). 


Figure 3: Discourse structure of Text 1 (free Tagalog formulation, translated 
back into English) 


265 


Anja Latrouite & Arndt Riester 


The first short text deals with the life of cats and the silly things they do. The QUD- 
tree analysis is shown in Figure 3 (target sentence in boldface). The actor cat was thus 
given, and intended was focus on the verb and the undergoer rats, i.e. [[cats]+ also [catch 
rats]p]~. The text was conceived in a way that no information-structural prominence in 
the sense of (7) for either argument needed to be marked morphosyntactically. Based 
on Hypothesis (iii) developed in 81.2, we predicted no special syntactic structure for the 
target sentence in this case, i.e. only the normal predicate-initial structure. But we ex- 
pected actor voice to be the preferred choice for this scenario because the undergoers are 
non-specific and the verbs are not undergoer-oriented. This is indeed what we found, as 
Aoi11”, discussed below in (24), shows. While we only give the translation of one con- 
sultant here, all other consultants chose the same form, i.e. a predicate-initial sentence 
with actor voice. 

According to Roberts (2012), any discourse addresses the so-called "Big Question" Qo: 
[What is the way things are?) Many actual discourses, however, start in medias res (Firbas 
1992), i.e. they use grammatical means (like presupposition triggers or non-default con- 
stituent order) to express that certain information should be accommodated (i.e. treated 
as if it were given), which boils down to answering a more specific subquestion (indi- 
cated in our example text by Qo; and Qo 111°). Since such grammatical means differ from 
language to language, we cannot be sure, at the outset, that our QUD-analysis for the 
initial sentence is correct. It is generally advisable not to draw any crucial conclusions 
from discourse-initial sentences (which we don't). In this example, we merely propose, 
without proving it, that the speaker has chosen to start a contrastive discourse strategy, 
analogous to the one in (18), $2.1, with the goal to provide information about different 
animals. We suggest therefore to analyse the expression cats in Ao44 of (22) as a con- 
trastive topic. Combinations of cr and r can be coded via ay-inversion in Tagalog, as 
shown in example (11) and suggested in Hypothesis (i), cf. Latrouite (2017). 


(22) Qo: {What is the way things are?) 
> Qoa: (What do different animals do?) 
>> Qo.11: (What do cats do?) 


>>> Au: [[Ang mga pusa]cr ay [g<um>a-gawa ng mga wala-ng 
NOMPL cat INV <AV.RLS>IPFV-do GEN PL NEG-LK 
kabuluha-ng bagay]r]~. 
sense-LK thing 


‘Cats do a lot of things without sense’ 


Assertion Ag is then elaborated by use of three partial answers to the question in Qo1.11 
in (23). The three parallel answers, hence, all carry focus on the predicate and undergoer. 
Given that the actor is topical and the undergoer is focal, i.e. given that the arguments 
have their IS-default values, Hypothesis (iii) delegates the voice selection to the level of 
referentiality. As the referent of the undergoer argument is non-specific, like in example 


Note that the sub-numbering convention indicates that there is an entailment relation between the ques- 
tions. For instance, every answer to Qo, is, at the same time, an answer to Qo, and, in turn, to Qo. 
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(1a) in the introduction, actor voice is expected to be preferred, and this is also what we 
find in the translations of all four consultants. Note that the sentences in (23) and (24) 
have default constituent order. The reason why the PSA is not in final position here is 
its occurrence as a (clitic) pronoun. 


(233) >>>Qo111: (What exactly do they do?} 
>>>> Au: [[Um-a-akyat]p [sila] [sa mga kurtina]p]~ 
Av-IPFV-climb 3PL.NOM DATPL curtain 
"Ihey climb up on curtains; 
>>>> Aoi: at [[nag-da-dala ng mgadaga sa bahay]r]-. 
and AV.RLS-IPFV-carry GEN PL mouse DAT house 


“and bring home mice: 


(24) Target sentence: 


>>>> A0111”: [Kung may pagkakataon,] yaz [[nang-hu-huli]g ^ rin [sila] 
if EXIST opportunity, AV.RLS-IPFV-catch also 3PL.NOM 
[ng malaki-ng daga]r]-. 
GEN big-LK mouse 


"Cats also catch big mice (= rats), when there is an opportunity: 


Note that, in the final assertion A," in (24) the preceding if sentence, labelled as not 
at issue, does not answer the question under discussion Qy, ; ;, but rather expresses the 
relevance condition for the truth of the sentence containing the actual at-issue content. 
The target sentence cats also catch rats has a parallel structure to the two preceding 
sentences that answer the same question under discussion. All sentences have the same 
basic structure, i.e. they are predicate-initial and show actor voice. Note that the focus- 
sensitive particle rin appears directly in the postverbal position in A 1 1". 


3.3 Text 2 - Intended: Narrow Actor Focus, i.e. Actor (focal) Verb 
(backgrounded) Undergoer (topical); Partly realized: Actor 
(contrastive topic) Verb (focus) Undergoer (topical) 


Qo: (What is the way things are?) 


Qo4: {Who catches rats?) 


Au: It is not only bears and 


Au: Cats catch them. 
wolves that threaten and catch rats. l 


Figure 4: Discourse structure of free formulation based on Text 2 
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The second short text!” (discourse structure given in Figure 4) was intended to yield a nar- 
row focus on the actor argument in the target sentence [[Cats]p also catch [rats] z]- and, 
thus, represent one of the marked constructions mentioned in Hypotheses (iv) and (v), 
ie. either Verbay Undergoerr Actorg or Actorg tay Verbuv Undergoery, because both 
arguments are information-structurally prominent according to (7). In order to force a 
context that would yield a narrow actor focus question as the natural implicit question 
under discussion for the target sentence, we provided a preceding sentence containing 
the focus sensitive particle om Jong ‘only’ with scope over those actor arguments that 
the actor cat in the target sentence was supposed to be contrasted with. The negation of 
the exhaustive particle in the first sentence was meant to be an indicator of the ensuing 
parallelism. In order to express the (negated) exhaustive narrow focus on the actor argu- 
ment expressed in the context sentence, the ang-inversion construction (compare (15b) 
in 81.4) was chosen by all Tagalog translators, as shown in (25). 


(25) Qo: {What is the way things are?) 
> Qoa: (Who catches rats?} 
>> Ao: [Hindi lang [ang mga oso at lobo]pang na-nakot at 
NEG only NOM PL bear and wolf NOM Av.RLs.scare and 
nag-hu-huli [ng mga daga]T]-. 
AV.RLS-IPFV-catch GEN PL rat 


“It is not only bears and wolves that threaten and catch rats. 


Subsequently, speakers did not continue with another ang-inversion but chose ay-inver- 
sion for the target sentence. As for ay-inverted narrow actor focus, Hypothesis (v) made 
us expect undergoer voice to mark the topicality of the undergoer. Indeed, two con- 
sultants offered this construction, which is given in (26).1Š The two other consultants 
suggested the construction in (27), i.e. ay-inversion of the actor combined with actor 
voice. 


(26) Target sentence: 
>> Agi: [[Ang mga pusa]p din ay h<in>u-huli [sila] r]~. 
NOM PL cat also INV «UV.RLs--IPFV-catch 3PL.NOM 


‘Cats also catch them? 


(27) Alternative realization of target sentence: 
>> Qı: {Who does what to rats?) 
>>>Q,, {What do cats do to rats?} 


V'For unknown reasons, the consultant changed the names of the animals from the original text. 

18 A predicate-initial construction as in (10b), with undergoer voice and the genitive-marked actor in sentence- 
final, i.e. a prosodically prominent position, was rejected as “not good" by all consultants, i.e. it was neither 
offered in the translations nor accepted as a possible option, when we asked the speakers in Manila about 
this. This adds to our suspicion that the construction in question does not express narrow contrastive focus. 
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>>>> Aij: [[Ang mga pusa]cr (rin) ay [nang-hu-huli at 
NOMPL cat (also) INV AV.RLS-IPFV-catch and 
nag-ka-kain]p [ng mga daga]7]~. 
AV.RLS-IPFV-eat GEN PL rat 


‘Cats (also) catch and eat rats.’ 


Note that the target sentence Ay; in (27), which has a different label than the one in 
(26) due to a very different discourse structure, contains a given verb catch and a new 
verb eat. It is quite likely that the way Text 2 was formulated had the (unintended) effect 
that the consultants construed the verb coordination as being contrasted against the 
previous combination threaten and catch. Hence, we assume, as indicated in (27), that a 
CT+F structure was chosen with focus on the verb complex. This also leads us to diversify 
our inventory of hypotheses once more and to add a slight change to Hypothesis (v), 
restricting it to backgrounded verbs. 


Hypothesis (v) 


Actorg tay Verbgg Undergoery — Undergoer Voice (final) 


Hypothesis (vi) 


Actorcy +ay Verbp Undergoerr — Actor Voice 


While it may seem ad hoc to formulate a new hypothesis merely on the basis of a some- 
what unclear example like (27), we will come across a very similar example in the next 
section, which seems to confirm that Hypothesis (vi) is on the right track. 

Most of the Tagalog target sentences for Text 2 did not contain a word for also, which 
was contained in the original English text; somehow consultants seemed to feel that the 
ay-construction already conveyed an additive focus reading. Only one consultant paid 
heed to the focus sensitive particle in her translation and placed rin ‘also’ right after 
the actor in one instantiation of (27). As we have seen above and see again here, the 
positioning of rin indicates its scope. If the verb is part of the scope, rin appears after 
verb. If only the actor is in its scope, it appears right after the actor. 


3.4 Text 3 — Intended: Actor (focal) Verb (focal) Undergoer (topical); 
Result: Actor (contrastive topic) Verb (focal) Undergoer (topical) 


Next, we wanted to find out whether a truly new verb would make a difference for the 
construction chosen, so we construed a text in which both the actor and the verb were 
discourse-new. This text (discourse structure shown in Figure 5) is about the life of rats, 
i.e. it is about the undergoer of the target sentence. In the sentences preceding the target 
sentence there is a clear focus on the events that affect rats. The translation provided by 
one of our consultants contains simple predicate-initial structures. This is in line with 
our expectation regarding predicate focus sentences. 
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(28) 


(29) 


(30) 
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Qo: (What 
is the 
world like?] 
| 
Qo: (What 
about the 
life of rats?) 


Ao: diel 0 95 _ 
of rats is Qi: (What 
stressful and about rats?} 


dangerous. HN cod RR 
O>: (By what Q3: {Who 


Ay: They are Ap: and 


d b ick are they does what 
made nervous become sick ` de sick?) to rats?} 
Ay: due to Q31: (What 
the noise of do cats do 
the traffic. to rats?) 


E e 


also our | Q4: (When do 
domestic cats they do this?) 
catch rats | 


A4: when 
they get 
the chance. 


Figure 5: Discourse structure based on Text 3 


Qo: {What is the world like?) 
>Qo1: (What about the life of rats?) 


>> Aoi: [[Delikado at  ma-stress]p [ang buhay ng mga daga]1]-. 
Dangerous and stressful wom life GEN PL rats 


“The life of rats is stressful and dangerous? 


>> Q: (What about rats?) 
>>> Ar: [[Ni-ni-nerbyos]p [sila] r]- 
UV.RLS-IPFV-nervous 3PL.NOM 
“They are made nervous’ 


>>> Ap: at [[nag-kakasakit] p]~ 
and Av.RLS-become.sick 


“and become sick; 


>>> Q2: {By what are they made sick?} 


8 The role of information structure for morphosyntactic choices in Tagalog 


>>>> Aar [[gawa ng ingayng trapik]r]-. 
done GEN noise GEN traffic 


“due to the noise of the traffic: 
Orig: “The noise of the traffic makes them nervous and sick? 


Interestingly, the consultant combines an undergoer voice verb and an actor voice verb 
in (29), realizing the affector-actor within an optional adjunct phrase, which we treat 
as a separate information unit at the end of the sentence. We therefore have narrow 
(contrastive) predicate focus on the verbs in (29) and a secondary focus on the actor 
phrase, which is not at issue with respect to Q, but at issue with respect to the sepa- 
rate subquestion Q% in (30). Note that the way the text was construed, we had expected 
to obtain a parallel structure between two actors (or affectors), the traffic and the cats. 
However, since the speaker chose to demote the traffic to a peripheral adjunct and used 
the intransitive verb to fall sick, a non-parallel discourse structure resulted. 

Similarly to the narrow-actor-focus context, i.e. example (27) in 83.3, the actor phrase 
our domestic cats in the target sentence in (31) was not realized at the end of the sentence 
(hence, Hypothesis (iv) is not applicable), but in sentence-initial position before the in- 
version marker ay and, once again, actor voice instead of undergoer voice was chosen, 
despite the given undergoer. 


(31) Target sentence: 
>>> Q3: {Who does what to rats?) 
>>>> Q31: {What do cats do to rats?) 
>>>>> A31: At [[ang ating mga alagang pusa]cr ay [h<um>u-huli] p 
and NoMour PL pet cat INV <AV.RLS>IPFV-catch 
rin [ng daga]7]~ 
also GEN rat 


“And also our domestic cats catch rats; 


(32) >>>>> Q4: {When do they do this?) 


> > > > > > A4: [[kapag may pagkakataon]r]-. 
if exist chance 


‘when they get the chance’ 


We cannot claim that the speakers simply gave the exact same structure to this utterance 
as to the narrow actor focus construction in (27), because the particle rin appears after 
the verb, not after the actor. Otherwise, however, the syntactic structure and voice are 
identical and, hence, in accordance with Hypothesis (vi). As witnessed above, the ay- 
inversion seems to express the first half of a contrastive discourse strategy, which is 
mirrored in the assumption of Qs and Q; , in (31). (It is only now that the speaker realizes 
the contrastive potential between the negative effects of, respectively, the traffic and 
the cats on the well-being of the rats.) We, therefore, analyze the answer in (31) as a 
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CT-F structure.!? This assumption is perhaps corroborated by a statement from one of 
the consultants who suggested the construction in As; and who explained her choice 
as follows: “Both parts of the sentence De As, and A4] are about the cats, so we are 
talking about cats now”, i.e. we have shifted the topic to the cats. 

It is interesting to note that two people offered the construction given in (31), but two 
other consultants chose a predicate-initial construction with undergoer voice and the 
focal genitive-marked actor in the sentence-final position, as shown in (33). 


(33) Alternative target sentence: 
>>> Qi; (What do cats do to rats?) 


>>>> Au: [[h<in>u-huli]p rin [ang mga daga]r [ng mga pusa]cq]~. 
<UV.RLS>IPFV-catch also NOM PL rat GENPL cat 


‘Also cats catch rats: 


This construction, in which the focal genitive-marked actor is in the marked, sentence- 
final position and the topical undergoer triggers undergoer voice, is an example compa- 
rable to (10b) in 81.4, in which the cr is clause-final. The speakers chose for the target 
sentence a structure identical to the sentence preceding it, i.e. they construed for the 
preceding sentence (the noise of the traffic made them nervous) an vv-sentence with traf- 
fic as genitive-marked actor argument, so that the two affectors traffic and cats were 
contrasted. 

The focus sensitive particle rin appears in both realizations, (31) and (33), right after 
the verb indicating that the verb is included in its scope. 


3.5 Text 4 - Intended: Actor (contrastive topic) Verb (background) 
Undergoer (focus) 


The fourth text (Figure 6) deals with life in the wilderness and different kinds of preda- 
tors that chase and catch animals and are chased themselves. Therefore, the text consists 
once again of a number of parallel sentences involving two variables (cr«r) that moti- 
vate the general question Who catches whom? and the specific questions Whom do lions 
catch? and Whom do sharks catch? Note that these parallel contrastive topic-focus sen- 
tences all show the same construction: ay-inversion and actor voice, as shown in (35), 
in accordance with Hypothesis (i). 


(34) Qo: {What is the way things are?} 
> Qoa: (What is life in the wilderness like?) 


>> Aoi: [[Sa kalikasan]cy ay [may mga hindi patas na batas]g]-. 
DAT wilderness INV exist PL neg fair Lk law 


‘In the wilderness there are unfair laws: 


PBut we should keep in mind that the assumption of a single focus spanning both the actor and the verb, 
hence a direct answer to question Q}, is also still an option. 
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Qo: (What 
is the way 
things are?) 


Qoi: (What 
is life in the 
wilderness 
like?) 
Aor: In the Q,: {Who 
wilderness 
catches 
there are 
a whom?) 
unfair laws. 
: {Wh 
Qui: {Whom Q. 2: {Whom E Rs nee Qi3: {Whom 
do lions do sharks lows of nalure do cats 
? ? ? 
catch?) catch?) be observed?) catch?) 
Q>: {As for 
sharks and ; 
A44: Lions catching, As: Eyen M 
caich A45: sharks what-else is our cities A43: Our cats 
antelopes catch tunafish. Tannerin this can be catch rats. 
pem ER ier observed, too. 
whom?) 
I 
Q31: {Whom 
are sharks 
getting 


caught by?) 


A31: They are 
furthermore 
getting caught 


by humans. 


Figure 6: Discourse structure based on Text 4 
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(35) >>Q,: {Who catches whom?) 
>>>Q,,: {Whom do lions catch?) 
>>>> Aj; [[Ang mga leon]cr ay nag-hu-huli [ng mga bayawak];]~ 
NOM PL lion INV AV.RLS-IPFv-catch GEN PL antelope 
‘Lions catch antelopes, 
>>> Qi»: {Whom do sharks catch? 


>>>> Aj: [[Ang mga pating]cry ay nang-hu-huli [ng mga tulingan]r]-. 
NOM PL shark INV AV.RLS-IPFV-catch GEN PL tunafish 
‘sharks catch tunafish: 


The next sentence, A» in (36), signals, both in its original formulation and in the Tagalog 
version, a change in discourse strategy: both the (nominative-marked) sharks and the 
activity of catching are kept up from A4» in (35) to A21 in (36). However, the sharks 
change their role from actor to undergoer, which is expressed by a contrastive change 
to undergoer voice. Furthermore, a new actor (humans) is introduced. We tentatively 
propose the - unusual - information structure shown in (36), in which the voice infix 
is assigned the function of a contrastive topic, while the actor humans is the focus. This 
also explains the slightly cumbersome formulation of Q> and Q21. Under this analysis, 
the ay-inverted pronoun is merely an aboutness topic; a constellation for which we have 
not formulated any hypothesis. 


(36) >>>> Q: {As for sharks and catching, what else is happening and with whom?} 
>>>>> Q21: {Whom are sharks getting caught by?) 


>>>>>> Aan: [[Sila]p ay h[<in>]cru-huli din [ng mga tao]r]-. 
3PL.NOM INV «UV--IPFV-catch also GEN PL man 
‘They are furthermore getting caught by humans’ 


Original: and they happen to also get caught by humans 
themselves: 


The speaker then jumps back up in the tree with a comment about cities, (37). What we 
see in A; is another ay-inversion expressing a narrow contrastive focus on a locative 
adjunct.?? 


(37) >>> Qs: {Where else can these laws of nature be observed?} 


>>>> A3: Kahit[[sa ating mgalungsod]p ay na-o-obersbahan din 
Even DATour-LK PL city INV UV-IPFV-observe also 
[ito] r.]- 
DEM.NOM 
‘Even in our cities this can be observed, too? 
Original: “Even here in the city these cruel laws of nature can be 
observed: 


20The verb observe is treated as salient here, although it has not been mentioned explicitly in the previous 
discourse. 
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The target sentence cats also catch rats is then realized by three speakers as an ay- 
inversion with actor voice, ie. the speaker is returning to the previously chosen dis- 
course strategy, expressed by Q4: (Who catches whom?) and the follow-up subquestion 


Q;5 in (38). 


(38) Target sentence: 
>>> Q13: {Whom do cats catch?) 


>>>> Aua: [[Ang nating mga pusa]cy ou nang-hu-huli [ng mga 
NOMour PL cat INV AV.RLS-IPFV-catch GEN PL 
daga]r]-- 


rat 


“Our cats catch rats? 


The fourth speaker suggested the ang-inversion in (39) - recall example (15) in 81.4 - thus 
either expressing the thought that the only ones who catch rats are cats - as indicated 
below - or, alternatively, that the only act of catching that takes place in the city is 
between cats and rats. The first option would mean that, in violation of the Q-GrvENNEss 
principle (82.2), the rats would have to be accommodated as given. The second solution 
would inevitably mean that the ang-inversion construction is not restricted to exhaustive 
narrow argument focus, but may also be used for (exhaustive) complex focus spanning 
both the actor and the undergoer. The data here are too limited to finally settle this 
question. However, given that we are not aware of languages in which cleft sentences 
are restricted to narrow argument marking, the second solution seems quite plausible. 


(39) Alternative target sentence: 
>>> Q4: {Who catches rats (in the city)?} 


>>>> Ay: [[Ang nating mga pusa]p ang nang-hu-huli [ng mga 
NOM our PL cat NOM <AV.RLS>IPFV-catch GEN PL 
daga]T]-- 
rat 


‘(Here) it is our cats that catch rats. 
Original sentence: “(Our domestic) cats also chase and catch rats: 


3.6 Text 5 — Intended: Sentence focus, i.e. Actor (focal) Verb (focal) 
Undergoer (focal); Partly realized: Actor (contrastive topic) Verb 
(focal) Undergoer (focal) 


Finally, we tried to construe a context for an all-new sentence, in order to elicit a clause- 
focus construction. In the text shown in Figure 7, a number of different scenes witnessed 
from a window were listed. Every sentence contained a new actor, a new verb and a new 
undergoer. 

In the first assertion Aoi in (40), the ang-inverted narrow (exhaustive) focus on panay- 
away at kaguluhan ‘constant fighting and turmoil’ is motivated by the focus-sensitive 
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Qo: (What is 
the world like?) 


Qo: (What 
do I see?) 
Ao: When I 
look out of the Q,: {Who 
window, I see does what?} 


only unhappiness 
and turmoil. pe dim c 
Qu: {What do the Q,,: (What do Qi4: {What do 
neighbours do?) the bears do?) the cats do?! 


A44: The neigh- 
bours fight with 
the children. 


À,5: Bears A,3: and cats 
persecute birds catch rats. 


Figure 7: Discourse structure based on Text 5 


particle only, i.e. the question Qo: {What do I see?} is accommodated. Moreover, the 
when-clause is interpreted as a relevance condition which is not at issue. 


(40) Qo: {What is the world like?) 
> Qo4: {What do I see?) 


>> Ag 1: [Pag t<um>i-tingin ako sa bintana,] yag [[panay-away 
when <AV.RLS>IPFV-look 156.NOM DAT window constant fight 
at kaguluhan]p lang ang na-ki-kita ko]-. 
and turmoil only NOM UV-IPFV-see 1SG.GEN 


‘When I look out of the window, I see only constant fighting and 
turmoil? 


This sentence is followed by yet another sequence of ay-inversions (i.e. CT-F structures), 
describing who (cT) is performing which act of violence: the focused elements are, there- 
fore, the Av-marked predicate and the undergoer. The construction fits the pattern de- 
scribed in Hypothesis (i).?! 


T What this example shows is that the construction of a text that consists of only new and unrelated sentences 
is in fact a very unnatural task. Instead, human interpreters will seize every opportunity to bring structure 
(here: cT-F pairs) - and therefore coherence - into what was originally intended to be an unstructured 
sequence of sentences. 
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(4) >> Qi: {Who does what?) 

>>> Qui: {What do the neighbours do?) 

>>>> Aj: [[Ang mga kapitbahay]cy ay [nag-a-away ng mga bata]r]-. 
NOM PL neighbour INV AV.RLS-IPFV-fight GEN PL child 

"Ihe neighbours fight with the children? 

>>> Qi»: (What do the bears do?) 

>>>> A12: [Lang mgaoso]cr ay [t<um>u-tusig ng mga birds]r]- 
NOM PL bear INV <AV.RLS>IPFV-persecute GEN PL bird 


"Bears persecute birds' 


(42) Target sentence: 
>>> Qijs: {What do the cats do?) 


>>>> Ais: at [[ang mgapusa]cr ay [nang-hu-huli ng mga daga]r]. 
and NOMPL cat INV AV.RLS-IPFV-catch GEN PL rat 


“and cats catch rats. 


Three speakers continued with a third ay-inverted cT-F assertion for the target sentence, 
as shown A5 of (42). Only one speaker chose an all-focus existential construction for 
the target sentence. 


(43) Alternative target sentence: 


>> Q5: {What other violent things are happening?! 


>>> Ax: [[May mga pusang h<um>a-habol at p<um>a-patayng mga 
exist PL cat-LK <AV>-IPFV-catch and <AV>-IPFV-kill GEN PL 
inosente-ng mga daga]r]-. 
innocent-LK PL rat 


"Ihere are cats chasing and killing innocent rats: 


4 Summary of findings, conclusions 


Table 1 sums up our findings with respect to the target-sentence translations discussed in 
the paper. Note that the other (non-target) sentences discussed in the previous sections 
are not mentioned in the table, although their analyses, too, are in accordance with the 
described hypotheses. 

In four of the texts (except for Text 1) the actor was focal (which includes contrastive 
topics) and thus IS-prominent, in two of them (Text 2 and 3) the undergoer was topi- 
cal/backgrounded and thus IS-prominent. In our examples, actor voice was the preferred 
choice. As Table 1 shows, with the exception of Text 1, the actor argument was always 
conceived of as a contrastive topic or a focus. Contrastive topics (and, otherwise, narrow 
foci) were typically ay-inverted (Texts 2-5). Notable exceptions were the extraposed cr 
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Table 1: Information structure and morphosyntactic choices (voice, constituent 
order and inversion): target realizations in five sample texts 


Structure Voice Example Hypothesis Text 
[V UG]p ACTr AV (24) (iii) 1 
ACTrE ay VBG UGr UV (26) (v) 2 
ACTcr ay Vr UGr AV (27) (vi) 

ACTcr ay Vr UGr AV (31) (vi) 3 
Vp UGr ACTcr UV (33) m 

ACTcr ay VpG UGr AV (38) (i) 4 
ACTrE ang VBG UG TF?) AV (39) = 

ACTcT ay [V UG]r AV (42) (i) 5 
[may ACT V UG]r AV (43) (i) 


in (33), the existential construction in (43) as well as the ang-inversion in (39), whose 
information-structural analysis remains uncertain. Undergoer topicality did not gener- 
ally lead to the selection of undergoer voice in the presence of a focal actor (Texts 2 and 
3). The current data lead us, thus, to the conclusion that focality of the actor is definitely 
more salient, and has a greater morphosyntactic effect, than topicality of the undergoer. 

In general, it has become clear that in order to describe the information-structural 
impact on voice selection, a more comprehensive approach is necessary rather than sim- 
ply considering givenness and newness of arguments. This paper? is the first to apply 
the new QUD-tree method described in Riester, Brunetti, et al. (n.d.) to Austronesian 
language data. It, therefore, demonstrates a completely new way of studying the infor- 
mation structure of a lesser-described language on the basis of textual corpus data. 

With respect to our hypotheses, we can specifically state the following based on our 
case study: Hypothesis (i) was confirmed in the data. 


Hypothesis (i) 


Actorpycp tay Undergoerp — Actor Voice 


However, we must add that, in our data, basic sentences structure was not chosen at 
all to encode two focused (or cr) core arguments. Rather we found the ay-inversion 
construction to be the predominant pattern (Texts 4 and 5). 


Hypothesis (ii) 


Actor; Undergoery — Undergoer Voice 


22See also the paper by Riester & Shiohara (2018 [this volume]). 
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Hypothesis (ii) was not truly investigated here, as basically all undergoer voice sentences 
found and discussed in the literature are of this type. Therefore, it was not the most 
interesting case to look at. Our target sentence did not appear in a narrow verb-focus or 
adjunct focus context, and apart from one sentence with a focal locative adjunct, (37) in 
Text 4, which indeed showed undergoer voice, we have nothing to add to this particular 
issue. 


Hypothesis (iii) 
Actory Undergoerg — decided at different level 


Since Hypothesis (iii) left it open whether a default information structure would lead to 
actor or undergoer voice, it was not actually challenged by our data. In the undergoer- 
verb-focus scenario (Text 1), the participants provided actor-voice sentences; more specif- 
ically, they chose the expected unmarked verb-initial word order. 


Hypothesis (iv) 
Undergoery Actorr (default order) — Actor Voice 


Hypothesis (iv) was discussed in $1.4 and eventually restricted to cases with default 
constituent order, thus excluding patterns with ay-inversion and extraposed actors. Re- 
member that narrow actor focus in our data was expressed by means of ay-inversion. 
Hence, Hypothesis (iv) did not apply to any of the cases found in the data. 

Based on the great variety of cases involving ay-inversion in our data, we formulated 
two more hypotheses regarding ay-inverted actors and clause-final topical undergoers. 


Hypothesis (v) 
Actorg tay Verbgg Undergoerr — Undergoer Voice 


Hypothesis (v) specifies one way of realizing narrow actor focus, hence, the situation 
intended in Text 2, which was found in sentence (26). 


Hypothesis (vi) 
Actorcy +ay Verbp Undergoerr — Actor Voice 


Finally, Hypothesis (vi) could be confirmed for all instances exhibiting the respective 
syntactic-pragmatic pattern, which, however, were produced by the consultants more or 
less by accident, since the cT-F structures they came up with deviated from the originally 
intended information-structural constellations. Notably, if we examine all examples that 
exhibit an ay-inverted contrastive topic and a focus on either the verb, the undergoer or 
both, we always witness actor voice. 

The lesson we learn from our approach is that, on the one hand, it is quite difficult to 
steer participants to produce a very specific information-structural pattern and obtain a 
natural discourse at the same time. On the other hand, the QUD-tree method allows us 
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to fruitfully analyze and interpret the actually produced data despite the deviations we 
gained. 

In the current study we have presented many new facts about information structure 
marking in Tagalog and, in particular, about its relation to voice and inversion. We were 
able to specify a number of detailed hypotheses, and data that match them. We must 
leave it to future work, though, to test these hypotheses on a bigger scale and to iden- 
tify a more general explanation why certain voice forms were chosen under specific 
information-structural constellations. 
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This paper describes the constituent ordering and other basic morphosyntactic properties of 
Sumbawa and their relation to information structure. Our study is based on conversational 
corpus data and makes use of a novel method of information-structural discourse analysis, 
which is based on the reconstruction of implicit questions under discussion (QUDs). 


1 Introduction 


Sumbawa (indigenous designation: Samawa; ISO-639-3 code: SMW) is a language spo- 
ken in the western part of Sumbawa Island, Indonesia. Sumbawa belongs to the Bali- 
Sasak-Sumbawa subgroup of the Malayo-Polynesian branch of the Austronesian lan- 
guage family (Adelaar 2005; Mbete 1990: 19). In this paper, we investigate the variation 
of constituent order in Sumbawa verbal clauses, using transcripts of a spoken conversa- 
tion. In particular, we are interested in the question of how morphosyntactic variation 
(in particular, pre- and postverbal argument realization as well as the occurrence of cli- 
tics) is correlated with information structure, an aspect of Sumbawa about which so far 
relatively little is known. 

In order to understand how information-structural variation is expressed, we use a 
novel method of textual analysis, developed in Reyle & Riester (2016); Riester et al. (in 
press), whose goal it is to identify for each elementary assertion the implicit question 
under discussion (QUD) (van Kuppevelt 1995; Büring 2003; Roberts 2012) to which the 
assertion provides an answer. Based on these QUDs the information structure of each 
assertion can be straightforwardly determined. The structure of this article is as follows: 
82 provides an outline of the verbal clause structure in Sumbawa, with a special focus on 
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syntactically possible constituent-order variation and its correlation with the presence 
or absence of a clitic pronoun on the predicate, whose dependence on information struc- 
ture we will explore in the subsequent sections. In 83, we will introduce the annotation 
method we will apply to the conversational data in order to determine the information 
structure of each utterance. $4 and 85 provide the result of the application: 84 gives a 
rough picture how the three categories focus, background, and contrastive topic shape the 
general constituent order of Sumbawa, while 85 focuses on the order of argument and 
predicate in relation to the presence or absence of the clitic pronoun. In 86, we will give 
a summary of the sections and evaluate the effect of the method. 


2 Constituent order and clitics in Sumbawa 


In this section, we discuss the morphosyntax of Sumbawa verbal clauses, with a spe- 
cial focus on syntactically possible constituent order variation and its correlation with 
the presence or absence of a clitic pronoun on the predicate, largely based on Shiohara 
(2013b,a), drawing on elicited data. 

In Sumbawa, as reported in Shiohara (2013a: 174), sentences can be formed using only 
a predicate which may carry clitics indicating, for instance, tense and subject/agent. One 
or several participants of the situation expressed by the predicate can be omitted when 
their identity is clear from the previous utterance or the utterance situation; see examples 


(1)-(6). 


(1) ka=kusteri’ (2) ka=muz=teri’ (3) kasteri’ 
PST-1sc-fall PST-2SsG-fall PST-fall 
‘T fell? "You fell? ‘He/She/They fell’ 
(4) ka=ku=inum (5 ka=mu=inum (6) ka=ya=inum 
psT=1sG=drink PST-2SG-drink PST-3-drink 
‘I drank it." "You drank it. 'He/She/TIhey drank 
it? 


The occurrence of the subject/agent clitic is determined by three factors: (i) (in-)transi- 
tivity of the main verb, (ii) the person of the single core intransitive participant (S) or 
transitive agent (A), and (iii) the overt realization or absence of the argument and its 
position. The information structure of a clause has an influence on (iii) and, therefore, 
on the occurrence of the clitic. 

Table 1 shows a list of the clitic and independent pronouns. There is no distinction be- 
tween singular and plural in the third person. As can be seen in examples (1)-(6), the first 
and second person clitic pronoun may (but need not) occur both on intransitive and on 
transitive verbs. The third person clitic ya exhibits an exceptional behavior in that it can 


1Something the referent of which is clear from the context. 
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Table 1: Sumbawa independent and clitic pronouns 


Person & Number Free pronoun Clitic pronoun 
1SG aku ku= 
1PL.INCL kita 
° tu= 
1PL.EXCL kami 
2SG kau mu= 
2PL nënë nënë= 
3 nya ya= 


only occur on transitive, e.g. (6), but not on intransitive verbs, e.g. (3). Sentence (7), in 
which ya co-occurs with an intransitive verb, is not accepted by the speakers.” 


(7) * ka=yasteri’ 
pst=3=fell 


Intended meaning: 'He/She/They fell? 


The person of the transitive patient (P) is never coded on the predicate, but can be ex- 
pressed by use of an independent pronoun (or, of course, a lexical NP). Pronouns and 
lexical NPs behave syntactically in the same way; see (8). 


(8) ya=tariaku/kau/nya/tau nan 
3=wait 1sG 2sG 3 person that 


‘She is waiting for me/you/him/that person’ 


The constituents for S, A, or P may principally occur before or after the predicate. How- 
ever, first and second person pronominal S arguments cannot occur after the predicate. 
Sentences (9) and (10) are not permitted by the speakers. 


(9) *ka-ku-teri' aku (10) *kasmu-teri' kau 
PsT-1sG-fall isG PST-2sG-fall 2sG 
Intended meaning: “I fell? Intended meaning: “You fell? 


By contrast, the S constituent for the third person can occur after the predicate, for 
instance, as the NP tódé nan ‘that child’ in sentence (11) or nya ‘he/she/they’ in (12). 


?Unlike some other Austronesian languages, such as Acehnese (Durie 1985) and Tukang Besi (Donohue 
1996), which exhibit so-called split-intransitivity, Sumbawa does not make a syntactic distinction between 
agentive intransitive verbs, which typically denote a volitional action, and non-agentive intransitive verbs, 
which typically denote a non-volitional situation. Thus, intransitive verbs such as barari ‘run’ or barnang 
‘swim’ that denote a volitional action behave in the same way as the non-volitional verb teri ‘fall’, which 
we use as a representative of intransitive verbs throughout this section. 
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(11) kasteri’ todé nan (12) ka=teri’ nya 
PST-fall child that PST-fall 3 
“That child fell’. 'He/She/They fell’. 


Sumbawa does not exhibit inflectional case marking. Yet, as for the post-predicate con- 
stituents, the case frame exhibits an ergative pattern, in that A occurs in a PP form with 
the preposition ling, as shown in examples (13)-(15), while an S constituent, as shown in 
examples (11) and (12) above, and P, as given in the NP kawa nan “that coffee' in sentences 
(13)-(15), occurs in the form of an NP? 


(13) ka-ku-inum kawa nan ling aku 
PsT-1sG-drink coffee that by 1sc 


‘I drank that coffee’ 


(14) ka=mu=inum kawa nan ling kau 
PST-2SG-drink coffee that by 2sG 


“You drank that coffee: 


(15) kazya-inum kawa nan ling nya Amin 
PST-3-drink coffee that by Mr. Amin 


“Amin drank that coffee: 


S, A and P may all occur before the predicate, as (16)-(21) show. All arguments are ex- 
pressed by an NP (i.e. without preposition) in this position. (We omit the second person, 
since it is analogous to the first.) 


(16) aku ka=teri’ 
isc PST-fall 


‘T fell? 


(17) tódé nan ka=teri’ 
child that psr=fall 


“That child fell? 


(18) aku ka=inum kawa nan 
1SG PST-drink coffee that 


‘I drank that coffee? 


3Strictly speaking, it is only for the third person referent that the ergative pattern is fully observed since, as 
we already showed in examples (9) and (10), a first or second person S may not occur in post-predicate po- 
sition. Another split in alignment, which we already mentioned in connection with (3) and (6), is observed 
with regard to clitic pronouns. The distribution of the first and second person clitic exhibits the accusative 
pattern in that it may code the person of S and A, but not that of P, while that of the the third person 
clitic exhibits the ergative pattern in that it may only code the person of A, not that of S and P. A split in 
alignment determined by person is commonly observed cross-linguistically, cf. Siewierska (2013). 
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(19) nya Amin ka-inum kawa nan 
Mr. Amin PsT-drink coffee that 


“Amin drank that coffee? 


(20) nya ka=inum kawa nan 
3  Psr-drink coffee that 


‘He/She/They drank that coffee’ 


(21 kawa nan ka=ya=inum ling nya Amin 
coffee that pst=3=drink by Mr. Amin 


‘Amin drank that coffee’ 


Whenever the S or A argument occur in pre-predicate position, there is no clitic pro- 
noun on the predicate. In other words, the clitic and the pre-predicate argument are 
in complementary distribution. Examples (22)-(24) are all ungrammatical. Note that the 
post-predicate PP indicating A obligatorily occurs with the clitic pronoun, as shown in 
examples (13)-(15) and (21) above. 


(22) * aku ka=kusteri’ 
1sG PST-1sG-fall 


Intended meaning: ‘I fell? 


(23) * aku ka=ku=inum kawa 
1SG pST=lsG=drink coffee 


(24) * nya Amin ka=ya=inum kawa 
Mr. Amin psr=3=drink coffee 


Only one NP may be fronted at a time; examples (25) and (26), with two fronted NPs, are 
not permitted by speakers. This is the only purely syntactic constraint on the relative 
order of verb and arguments. 


(25) * nya Amin kawa nan inum (A P V) 
Mr. Amin coffee that drink 


Intended meaning: “Amin is drinking that coffee: 


(26) * kawa nan nya Amin inum (P A V) 
coffee that Mr. Amin drink 


Intended meaning: “Amin is drinking that coffee: 


The following tables summarize the complex correlation between the type and location 
of the S/A argument on the one hand and the occurrence of a clitic pronoun on the 
verb on the other hand, for intransitive (Table 2) and transitive (Table 3) predicates. The 
symbols have the following meaning: - “no clitic pronoun”, + “a clitic pronoun occurs”, 


* 


“ungrammatical construction". The examples from above are indicated in brackets. 
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Table 2: Intransitive predicates: occurrence of clitics, subject type and subject 
position (examples indicated in brackets) 


1t person 2" person 3" person 


No overt subject + (1) + (2) - (3) 
Post-predicate subject NP * (9) * (10) - (11, 12) 
Post-predicate "subject PP" à i P 
Pre-predicate subject NP - (16) - - (17) 


Table 3: Transitive predicates: occurrence of clitics, agent type and agent posi- 


tion 
1st nd rd 
person 2 person 3 person 
No overt agent + (4) + (5) + (6) 
Post-predicate agent NP * Í j 
Post-predicate agent PP (ling) + (13) + (14) + (15, 21) 
Pre-predicate agent NP - (18) - - (19, 20) 


3 Information structure theory and questions under 
discussion 


In this section, we change from the grammatical description of Sumbawa to information 
structure theory, which, as we will show, will later help us account for the patterns de- 
scribed in the previous section. The central problem when studying the morphosyntactic 
realization of information structure in any language is the avoidance of circularity. Al- 
though variation in the constituent order of sentences often goes hand in hand with a 
variation of the discourse context in which these sentences occur, there is, at the surface, 
no syntactic focus-marking strategy that would universally apply in all situations to all 
languages. First of all, many languages have an in-situ focus as a default option, but some 
also have the possibility to explicitly realize focus clause-finally, by sorting constituents 
according to their information status or by applying extraposition. The opposite strategy, 
available in many languages, is the fronting or clefting of the focal constituent. Yet other 
languages exhibit no or very few morphosyntactic reflexes of information structure but 
instead mainly rely on intonation or prosodic phrasing to mark a focus that syntacti- 
cally remains in-situ. For overviews and comparisons see e.g. Zubizarreta (1998); Büring 
(2009); Skopeteas & Fanselow (2010); Güldemann et al. (2015); Féry & Ishihara (2016). In 
general, it is methodologically questionable to use morphosyntactic (or prosodic) indica- 
tors for the identification of information structure if, at the outset, little is known about 
information-structure marking in a particular language. 
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Information structure is a pragmatic phenomenon, i.e. it is usually described in terms 
of meaning categories which relate to context. This is what accounts for both its univer- 
sality and its somewhat enigmatic status. Throughout the literature, focus has - very in- 
consistently - been described as the answer to a question (Paul 1970 [1880]; Halliday 1967; 
Roberts 2012), as alternative-evoking (Rooth 1992), asserted (Lambrecht 1994), new (Hal- 
liday 1967; Schwarzschild 1999), identificational (Kiss 1998), exhaustive (van Rooij 2008) 
or contrastive (Katz & Selkirk 2011). A topic is usually defined - in a rather noncommittal 
way - as that which a sentence is “about” (Hockett 1958; Strawson 1964; Reinhart 1981; 
McNally 1998; Jacobs 2001; Krifka 2008), and the notion of contrastive topic has received 
a sophisticated characterization in terms of a speaker strategy (Büring 2003) to answer 
a complex question by working through a list of subquestions. Neither of these concepts 
easily connects to naturally occurring text or speech data. In order to study the informa- 
tion structure of language data gathered in fieldwork, people have, therefore, typically 
reverted to question-answer scenarios or other semi-spontaneous methods like the use of 
pictures or stories (cf. Skopeteas et al. 2006) to elicit controlled, information-structurally 
relevant material. 

In the current study, we will apply a new annotation method based on implicit 
questions under discussion (QUDs), cf. Stutterheim & Klein (1989); van Kuppevelt (1995); 
Roberts (2012); Beaver et al. (2017), which enables a pragmatic information-structure 
analysis of textual fieldwork data. The method, so far, has mainly been applied to French 
and German (e.g. Riester in press). Among the aforementioned definitions of focus, we 
adopt the one that takes focus to be the answer to the current question under discussion. 

When investigating dialogues, as we do in this study on Sumbawa, explicit questions, 
of course, allow us to study the morphosyntactic realization of the background-focus 
divide. Examples of overt (Q)uestion-(A)nswer pairs are provided in the following Sum- 
bawa examples (27). 


(27) Qç: ada ke cabe nana 
exist Q chili over.there 


"Was there chili over there (in Japan)?' 
Az: [ada [si]p [cabe] z]- 
exist PTC chili 


“There was chili’ 


(27) is an example of a polarity question, which triggers a yes- or no-answer. In this case 
the addressee answers by means of a full clause, containing a verum focus, which is 
realized on the particle si within the predicate. 

As for the information-structural categories and markup we assume the following 
definitions: the focus (F) of a sentence is that part which answers the question, whereas 


^See also Latrouite & Riester (2018 [this volume]) on the use of QUDs for the description of voice selection 
in Tagalog. 
?In this example, the question is signalled by a rising final intonation. 
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the remainder (the information already contained within the question itself) is the back- 
ground. Following Reinhart (1981); Jacobs (2001) or Krifka (2008) we, furthermore, as- 
sume that aboutness topics (T) are referential entities (terms) that are properly contained 
in (and, therefore, potentially smaller than) the background.Ó Following standard as- 
sumptions in Alternative Semantics (Rooth 1992; Büring 2008; 2016), answers (consisting 
of an obligatory focus and an optional background) are so-called focus domains, which 
are marked by the ~ symbol.’ Choice questions (also known as alternative questions) 
presented as disjunctions may trigger a constituent focus, which is why the answer in 
(3)Šis not a polarity focus clause. 


(28) Q»: no roa tu=satoan tris ke, atau ada waya tu-satoan 
NEG possible 1pL=ask always Q or exist time 1PL=ask 


‘Can we never ask, or is there a (proper) time to ask?’ 


A27: [[Ada waya]r tu=katoan]~ 
exist time ` 1PL-ask 


"Ihere is a time for asking: 


Since both alternatives contain the same verbal element tu=satoan '1PL=ask', this element 
is assumed to figure as the background of the answer. (We will say more on this below, 
e.g. example (35), when discussing parallel statements.) 

Not all assertions, even in dialogues, are made in response to explicit questions, and 
not all explicit questions in dialogues receive a direct answer. In both cases we need to 
develop an idea of how to reconstruct the intended question, lest a large part of the as- 
sertions of the discourse will remain unanalyzed. Following Roberts (2012) and earlier 
Work, e.g. by Stutterheim & Klein (1989) or van Kuppevelt (1995), we assume that every 
assertion in a text is actually the answer to a (typically implicit) question under discus- 
sion (QUD). Thus, if we manage to determine the QUD of an assertion, its information- 
structural analysis (focus, background, aboutness topics) will follow, as in the case of 
overt questions. 

The non-trivial part, of course, consists in the identification of the QUDs of predomi- 
nantly monological passages of text. A solution to the problem is described at length in 
Riester et al. (in press), and we will only shortly sketch it here. First, we segment the text 
into separate speech acts (which are predominantly assertions). Besides orthographic 
sentence boundaries we also split coordinated phrases into separate semantic assertions 
(under the assumption that a coordination is an effective way of communicating a se- 
ries of statements at one stroke). However, argument clauses will not be separated from 
their matrix clauses, since this would result in ungrammatical sentence fragments. For 


‘Usually, the “comment” is taken to be the complement of a topic. Since we see no use for such a category 
in our current work, we will ignore it. 

TIn Rooth (1992: 85ff.) ~ operators are used, among other purposes, to establish question-answer coherence: 
both questions and focused answers represent sets of alternatives, and the - operator identifies the question 
set as a proper subset of the focus alternatives. 

The alternation satoan-katoan seems due to dialectal variation. 
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instance, (29)-(32) is the result of the segmentation of a short paragraph, where a letter 
A is assigned to each separate assertion. 


(29) A: seandai parak ne ketakit nya 
if close ITJ scared 3 


‘If someone approaches, they get scared’ 


(30) A: minada tau geing ne ketakit nya sate yang de sate 
if exist person foreign rrj scared 3 want like REL want 


“If there is a foreigner, they are scared that he is after them like a kidnapper 
(lit. a wanter)’ 


(31) A: barari 
run 


"Ihey run away: 


(32) A: nan luk model nan 
that way character that 


"[hat's their character? 


As we can see, the conditional clauses in (29) and (30) are not split into parts because this 
would lead to ungrammaticality: neither of the clauses with the conjunctions seandai ‘if’ 
in (29) and min ‘if’ in (30) is grammatical in isolation. 

The next step consists in the identification of the QUDs. The determination of QUDs 
in non-parallel text passages follows three basic principles (for a justification see Riester 
et al. in press and references therein): 


(i) Q-A-CONGRUENCE requires that the QUD for an assertion targets an actual con- 
stituent of the assertion. (It is not permitted to choose a question which does not 
ask for at least one constituent of the answer.) 


(ii) Q-GIvENNESS says that implicit QUDs can only contain linguistic material that has 
been mentioned or is salient in the current discourse context. 


(iii) Finally, MAxiMIzE-Q-ANAPHORICITY determines that all given material that oc- 
curs in the assertion is in fact mentioned in the question, thereby producing a 
maximally cohesive discourse (and, at the same time, a focus that is as narrow as 
possible). 


For instance, in a small toy discourse Ao-A,, the implicit QUD Qj is the one shown in 
example (33)? and its tree representation in Figure 1. 


(33) Ao: Last Sunday, we had a picnic in the park. 
Qj: (What about the picnic?} 
> Ar: [[The picnic] [consisted of sandwiches]p.]~ 
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H 


Au Qj 


Ay 


Figure 1: Tree representation of example (33) 


In contrast to Q;, the questions in (34) are all invalid because each of them violates at 
least one of the QUD constraints. 


(34) a. Q: {Which park was it?) #Q-A-CONGRUENCE 
b. Q: {What about the sandwiches?} #Q-GIVENNESS 
c. Q: {What happened in the park?} #MAXIMIZE-Q-ANAPHORICITY 


The question in (34a) violates Q-A-CONGRUENCE because it cannot have A, as its answer. 
(34b) contains the discourse-new expression sandwiches, thereby violating O-GIVENNESS 
and, finally, (34c) and in fact all questions in (34) violate MAxiMIZE-Q-ANAPHORICITY, 
because neither of them contains the expression picnic, which is given in Aj, since it 
already occured in Ap. 

However, a violation of the principle of Q-GIVENNEss is acceptable in two cases. The 
first one is the beginning of a text. Here, the implicit QUD sometimes needs to contain 
linguistic material that is discourse-new (for details on this process of accommodation, 
see citetriear17b). 

The other acceptable violation is found in connection with parallelisms. When a QUD 
is answered by a series of structurally analogous assertions, the assertions are defined 
to be parallel. In that case, the corresponding QUD is allowed to contain the shared 
content, even though it may not be given in the preceding discourse. For example in 
(35) the verb ate is not given in the context, and Q-GIVENNEss would predict question 
Q>. However, the double occurrence of the verb within two parallel statements (marked 
as A» y and A» 1”) licenses the formulation of the more specific QUD Q>. Note that it is 
generally the case that a sub-QUD like this is always entailed by the more general one, i.e. 
every answer to Q», is at the same time an answer to Q3, even though Q» determines 
a narrower (here: object) focus. 


(35) Au: We hada lot of fun at our picnic. 
OQ»: {What did we do at the picnic?) (QUD licensed by Q-GIVENNEss) 
> O54: {What did we eat at the picnic?) (sub-QUD, def. by parallelism) 
> Ao»: [[We]r ate [delicious sandwiches]g]- 


> Ao: and [[we]T even ate [sashimi]g]-. 
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Figure 2: Tree representation of example (35) 


Riester et al. (in press), following Büring (2003), also postulate a more complex case 
of parallelism that includes contrastive topics (indexed as cr in the annotation). This 
type involves two (or more) assertions, which are contrasted against each other at two 
different positions. An example is given in (36). 


(36) Qs: {Who ate what at the picnic?) 
> O44: (What did my girlfriend eat?) 
> A31: [[My girlfriend]cT ate [delicious sandwiches]r]- 
> Os»: {What did I eat?! 
>> Az»: and [[I]cy even ate [sashimi]p]~. 


A31 A32 


Figure 3: Tree representation of example (36) 


Analogous to example (35) above, the two statements Qs ; and Q3 > in (36) are parallel, be- 
cause they both describe events of eating, and they answer the same QUD Q3. However, 
the difference to (35) is that the assertions in (36) vary in two positions (here: subject 
and object), and that Qs contains two interrogative pronouns instead of one. This is typ- 
ically a sign that Qs is not answered directly but broken down into subquestions about a 
list of salient individuals (here: my girlfriend and I). We shall assume that the elements 
of the answers which answer Qs but which are backgrounded with regard to one of the 
subquestions are so-called contrastive topics, whereas those elements which answer both 
the superquestion Qs and one of the subquestions are foci. 


?The indentation symbol (>) before A, indicates that A, is subordinate to Q, in the corresponding discourse 
tree, whereas A, and Q, are at the same level. On different tree representation formats see Riester (in press). 
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Example (37), cited from the Sumbawa conversation, involves crs. The conversation 
topic is about the difficulties Muslims face in connection with Japanese food. In this 
example, conditional clauses represent the topical options (Haiman 1978; Ebert et al. 2014) 
that the speakers are contrasting against each other.? 


(37) Speaker S: 


Qs: {What if we ate pork unwittingly or knowingly? 
> Q31: {What if we did it unwittingly?} 


> Asi: [o ba terang,] yag [[lamin nongka tu=sangaja] cr [terang] yar [no 
ITJ clear if NEG.PST 1PL=act.deliberately clear NEG 
sikuda]r]- 
problem 
‘But it is clear, if we didn't act deliberately, it is clear that it is not a 
problem’ 

Speaker I: 

> A314: ya [[lamin nongka tu=sangaja]cr [no sikuda]r]- 
yes if NEG.PST 1PL-act.deliberately NEG problem 


"Yes, if we didn't act deliberately, it's no problem 


> Q32: {What if we did it knowingly?) 


> A32: tapi [[lamin ka-tu-to]cr [kan]yą; [no roa]r [dean 
but if PST=1PL=know you.know NEG comfortable that 
nan]T]- 
that 


‘but if we knew, we are not comfortable with that? 


Finally, an assertion may contain phrases which neither form part of the QUD nor con- 
tribute to answering it, ie. they are neither backgrounded nor focused. We classify 
such phrases as non-at-issue (NAI) material, cf. Tonhauser et al. (2013); AnderBois et 
al. (2015). More specifically, we are talking about triggers of conventional implicatures 
(Potts 2005), which include appositions, non-restrictive relative clauses, parentheticals, 
speaker-oriented adverbs, evidentials as well as adjunct phrases/clauses, cf. Riester et al. 
(in press) for more details and precise definitions. A few of these, namely the speaker- 
or addressee-oriented phrases it is clear that or you know are contained in (37). Another 
example is shown in (38). 


(38) Q191: {As for work, what attitude do people in Japan have?) 


> Aj, [maklum]wa; [tau  [ana]cy [mega disiplin]g]- [sia] NA: 
you.know people over.there very disciplined 2PL.HON 


"You know, my friend, people over there are very disciplined’ 


10Note that speaker I. is repeating the statement made by speaker S., which is indicated by a ‘+’ on the index. 
More on this in example (43) below. 
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4 Corpus analysis 


4.1 The nature of conversational data 


The data we are examining is the transcript of a Sumbawa conversation titled Memory 
of Japan, which consists of approximately 1500 words. The conversation took place be- 
tween two Sumbawa speakers and Shiohara, one of the authors. The main speaker, I., 
had been working in Japan as a trainee, and is talking about his experiences to Speaker 
S., one of his friends, and to Shiohara, in reply to their questions. 

In the conversation, Speaker I. talks about Japanese people, culture and society. The 
nature of the conversation has a crucial influence on the syntactic and pragmatic prop- 
erties of the discourse; many clauses have a first person A or S. Since the speaker talks 
about a number of situations as if they were general facts rather than his personal ex- 
periences, he often uses the generic first person plural form for the A and S arguments, 
especially the clitic tu; many verbal clauses carry the clitic pronoun tu or the indepen- 
dent pronoun kita. (Compare Table 1 in §2.) There are only few examples of a first person 
singular A or S, in which the speaker talks about a specific episode he was part of. 


4.2 Focus identification 


The conversation contains about 100 assertions (or discourse units), which were analyzed 
according to the guidelines in Riester et al. (in press). The general tendency observed in 
the conversation is that a referent or a situation that has been mentioned in the directly 
preceding discourse tends not to be expressed in the current clause. Thus, entities in 
the background are often left unexpressed, and a substantial amount of assertions are 
only formed by the focal expression itself. Nevertheless, because of the implicit refer- 
ents, these are strictly speaking not all-focus assertions but rather elliptical clauses with 
narrow verb focus. A3g in example (39) is an example of an assertion that only consists 
of a focused expression. The silent A argument for this clause is co-referential with the 
S argument tau in Ass and nya in As. 


(39) Ass: Toyama ta mega tertutup tau 
Toyama this very unsocial person 


‘In Toyama people are very unsocial? 
Q36: (What about the people in Toyama?} 


> Age: [seandai parak]|wa; ne [[ketakit] p [nya] 7]~ 
for.example close ITJ scared 3 


‘For instance, if someone approaches, they get scared. 


[...] 


1A transcription and translation will be made available in Shiohara (in preparation). The video record- 
ing of the conversation, available on YouTube, was created in collaboration with the Sumbawa Litera- 
ture Association (Yayasan Bungaku Sumbawa), of which the two speakers and Shiohara are members, cf. 
https://www.youtube.com/watch?v-D8gOyhJi1VI 
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> Osg: {What do they do in such a case?) 


> A3g: [[barari];] e 
run 


‘They run away: 


A better example of a sentence focus, which shows that the default constituent order of 
intransitives is V S, is shown in A44 of (40). 


(40) Q: {What is the situation in winter?) 


> Aq: [[kukir sarea apa godong]r]- 
withered all what leaf 


‘All leaves are withered’ 


The assertions can be classified according to the syntactic range of their focus con- 
stituents. In what follows, we will show examples of different types of focus, and we 
will examine the relative order of focus, background and contrastive topic in clauses 
with a narrow (or mid-size) focus. Our main interest in this research is the relative order 
of a predicate and its argument(s), and the effect this has on the presence of a clitic. 


4.3 Information structure and constituent order 


In an argument focus clause, the argument always precedes the background. Two ex- 
amples of preverbal (patient) argument focus can be observed in A43” in (41) and A4 y in 
(42). 


(41) Qi: (What (else) can we not eat?) 


> Aa," atau[[apa de terkait ke bawinan]pno tu=bau kakan]- 
or what REL related with pig that NEG 1PL-can eat 


‘also we cannot eat whatever is related to the pig? 
(42) Q41: (How much salary do we get?) 


>Agy: [[balu-pulu-ribu yen perbulan]g tu-terima]- 
eighty thousand yen per.month 1Pr-receive 


‘we are getting eighty thousand yen per month’ 


12A fronted NP may be followed by the discourse particle si, as observed in elicited sentences like (i). The 
typical usage of si is the indication of a contrast or a situation that is counter to an expectation. This usage 
was not detected in the current conversation. 


(i) todé nan si kasteri 
child that PTC psr=fell 


“That child fell” 


The precise conditions under which si occurs other than as a polarity particle need to be addressed in future 
research. 
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This generalization holds throughout our data. The only apparent exception occurs in 
the form of a repetition. The three assertions in (43) below convey the same assertion 
and are, therefore, labelled as A554, Ags 4, and A554,,. 


(43) Qo51: {Which privacy-related issues - said the friend - can you not ask a Japanese 
person about?) 


> A251: [umpama] nay [[umir.] ]- 
forexample age 


“For example, about the age; 


> Ags 44: [[umir nan]p na coba-coba katoan]~ [ling] wA; 
age that NEG.IMP try ask HEARSAY 


‘don’t try to ask about the age, he said? 


[...] 


> Aosi44: No roa tu=katoan umir tau jepang ta, 
NEG possible 1pL=ask age person Japan this 


‘It is not possible that we ask about the age of a Japanese person’ 


Repetitions, in some sense, defy the rules of proper information transfer because, from 
a logical point of view, a speaker should not assert something which is already implied 
by the common ground - which would amount to making an all-given (or focus-less) 
assertion. Intuitively however, speakers repeat themselves precisely because they are 
not confident that their interlocutor has already accepted their previous statement. It is, 
therefore, reasonable to assume that, under normal circumstances, a structurally iden- 
tical repetition has the same information structure as its previous mention. However, 
in example (43), the focus argument umir 'age' first occurs in an elliptical environment 
(A551). it then precedes the background in the second statement A554, (an imperative), 
while in the third, assertive, statement A55;,, it suddenly follows the verb. We believe, 
since this is the only case of a potential focus argument following a backgrounded verbal 
predicate, that, by means of the repetition, the speaker frees himself from the pragmatic 
requirements of the discourse context; thereby performing a kind of "context reset" with 
regard to the contents of his statement. It is, therefore, possible that the actual informa- 
tion structure of Ass... is that of an all-new assertion, in which the patient argument 
follows the verb, as shown below. 


(44) Q: {What is the way things are?) 


> A: [[No roa tu-katoan umir tau jepang ta,]r]- 
NEG possible 1pL=ask age person Japan this 


‘It is not possible that we ask the age of a Japanese person’ 


When an argument is backgrounded it follows the predicate in most cases, as shown in 
in Aan, (a backgrounded subject argument following the focused stative verb mogang 
‘to be light’). Another example of the same kind is As / (39), discussed above. 
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(45 Q>i: (How was the work like?! 


[...] 


> Azry»: [[mogang mogang]r [boat] 7]~ 
light light work 


"Ihe work is easy, so easy: 


4.4 Contrastive topics 


Apart from the conditional clauses previously discussed in (37), 83, we do not find in our 
data any paramount examples of overtly realized pairs of contrastive topics, i.e. those 
marked on referring argument expressions. What we do find is that, in a few cases, an 
argument that was given somewhere in the earlier discourse but didn't play a role in the 
preceding sentence, occurs in pre-predicate position. The question is now whether these 
entities should be assigned to the background (therefore representing aboutness topics) 
or not. For instance, in A194 of (38), repeated in (46), the phrase people over there refers 
to a group given in a distant part of the discourse context. Obviously, the speaker is not 
using this phrase as a simple aboutness topic. Instead, what he does is implicitly contrast 
the referent with a different group of people (presumably, those at home). Since we need 
to account for the topic change anyway, we make use of the structure introduced in 83 in 
connection with contrastive topics, i.e. a question-subquestion tree of which, however, 
only the first half is overtly realized, compare Riester et al. (in press: Sect. 5.2). Since 
there is no essential discourse-structural difference between such a “new” topic and the 
contrastive topics introduced above, we also tentatively assign (the contrastive element 
of) the referring expression the label cr, in the style of Büring (2003: 526). 


(46) Qi: {As for work, what attitude do different people have?! 
> Qj91: {As for work, what attitude do people in Japan have?) 
> Aq: [..] tau  [ana]ey [mega disiplin]z]- [...] 
people over.there very disciplined 


"People over there are very disciplined? 


In the very similar case of Aus) / (47), the demonstrative denan ‘that’ is introduced by 
the form lamin, tentatively glossed ‘as for’ here, and refers to ‘the time to get up’, which 
is the P argument of the predicate determine. What the speaker wants to express is that 
this particular issue was not regulated, although other things were, like e.g. the work- 
ing hours. Again, we represent this in the form of a contrastive / non-continuous topic, 
which is merely implicitly contrastive. 


(47) Speaker S.: 
Ou: (Which things were regulated?) 
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> Qi71: [.] waya mleng tunung apa waya, ada ke ya-tentukan kalis-kalis 
time getup sleep whattime existo 3-determine from 
ana e 
over.there ITJ 


‘Is there a regulation when to get up from sleep over there?’ 


Speaker L: 


> A171: [Lamin [denan] cr [no=soka] p ya=tentukan, ]~ 
as.for that NEG.PST 3=determine 


‘As for that, they didn’t prescribe it. 


Note that examples like these represent a challenge to the rules currently formulated in 
the guidelines of Riester et al. (in press), since the lack of an overt alternative makes the 
identification of such “implicitly contrastive topics” very difficult. Obviously, the QUD- 
tree framework needs to be enhanced with clearer rules concerning the identification of 
such non-continuous, implicitly contrastive topics. Kroeger (2004: 136) provides a brief 
and clear discussion on a distinction of topic types similar to those observed here. He 
discusses that “(T)he topic of a sentence, when it is the same as the topic of the preceding 
sentence, needs no special marking. It can often be referred to with an unstressed pro- 
noun, an agreement marker (as in the ‘pro-drop’ languages), or even by being omitted 
entirely (‘zero anaphora’). But in certain contexts the topic may require more elaborate 
marking. This may happen when there is a change in topic, a contrast between one 
topic and another, or a choice among several available topics”. See also Lambrecht (1994: 
117ff.) for a detailed discussion on the distinctions among various types of topics, and 
Centering Theory (Walker et al. 1998) for a closely related discourse approach to topics 
(or “backward-looking centers”). 


4.5 Other focus types found in the data 


In this section, we mention a few other focus types found in our data, before returning 
to our main issues defined in §2, constituent order and clitics. As observed in example 
(27), §4.3, repeated here as (48), a verum focus is realized on the particle si. 


(48) Qv: ada ke cabe nana 
exist Q chili over.there 


"Was there chili over there (in Japan)?’ 


> Az: [ada [si] [cabe] ]- 
exist PTC chili 


“There was chili’ 


The negative counterpart, a falsum focus, can be seen in Aj71 of example (47) in the 
previous section or, using a slightly different negative element, in A42 of (49). In both 
cases focus is realized on the negative element, which is followed by the predicate. 
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(49) Qu: nka kadu gita cabe pang Jepang ana? 
NEG.PST ever see chiliin Japan over.there 


“Have you never seen chili over there in Japan?’ 
> Aj: [[nka]p kadu gita [puin cabe]r]- 
NEG.PST ever see tree chili 


‘I have never seen a chili tree: 


In the case of focus on an NP modifier, the modifier retains its canonical (post-nominal) 
position. 


(50) Qs {What kind of chili was there?} 
> Qs 1: {What about a lot of the chili?) 
> Ag, cuma[de ka [peno]cr cabe ne, cabe [instan]r]- 
only mELPsr many chili rty chili instant 


Tt is just that a lot of chili was instant chili? 


Compare this to assertion A33 in example (51), in which a modified phrase with the same 
word order occurs in an adjunct clause (classified as non-at-issue) that is all-new and, 
hence, not narrowly focused. 


(51) Q33: {What do we have to do?) 


> Ass: [min sate parak ke tau  sowai gera]war [...] [harus 
if want approach with person woman beautiful have.to 
[tu-tahan ` rasa-ate]p]- 
1PL-control emotion 


‘If we want to approach a beautiful woman, we need to control our 
emotions: 


The situation is different with numerals. In Au of example (50), the focal numeral lima ‘five’ precedes the 
head noun. 


G) (Speaker talking to himself.) 


Qız: jadi ada pida kamar 
then exist how.many room 


“Then, how many rooms were there?’ 


> Aj: [[lima]g kamar]~ [amen no salah] az 
five room if | NEG wrong 


“There were five rooms, if I am not wrong; 
As Shiohara (2014) suggests, the position of a numeral relative to its head noun varies according to its 


focal status. If a numeral is not in focus, it follows the head noun, like a modifier does, though the data we 
examined in this article do not include any such example. 


302 


9 Information structure in Sumbawa: A QUD analysis 


5 Grammatical relations, person, agreement and 
constituent order 


5.1 Intransitives 


In 82, we introduced the basic morphosyntactic rules of Sumbawa - based on elicited 
data - in which A, S and P are realized. In this section, we will look at how information 
structure, observed in our annotated data, can explain some of the variation found. 

According to the rules postulated in Table 2 in 82, a first (or second) person S is ex- 
pected to be realized either as a pronominal clitic or as a pre-predicate argument. The 
first case is exemplified by (52). The tendency that an entity that was mentioned in the 
directly preceding utterance or that is generally salient is not overtly expressed applies 
to all types of arguments. Thus, when the referent of S or A is topical, it is expected to be 
realized (only) as a clitic pronoun within the predicate in first and second person. This 
is the case with the first person plural subject in (52). 


(52) Q33: {What do we have to do?) 


> A33»: [harus [tu-bersabar benar]p]~ [ampa] nar 
have.to iPL-patient really ITJ 


"We need to be really patient: 


An unexpected ct-case, which runs counter to the predictions, is A931 in example (53), 
in which S is doubly marked. 


(53) Q23: (Who does what in the morning?) 
> Q231: (What do we (Muslims) do in the morning?) 


> Ao31: [kan]ya; [[kita]cr [tu=sembayang-subu] [dunung,]r]~ 
you.know  iPLINCL 1PL=do.early-morning-prayer before 


‘As you know, we first do the early morning prayer: 


As mentioned in §2, this construction was not accepted by participants in an elicitation 
task when asked for their grammatical judgment, and was therefore classified as non- 
canonical, if not ungrammatical. In many languages, the complementary distribution of 
clitic and argument has made way for a more canonical agreement system, in which the 
two forms co-occur. We may see the phenomenon in Sumbawa in an ongoing process 
of a grammatical change.“ At the present stage of our research, though, we do not have 
enough data to say more about this. 

The third person S is expected to be realized as either a pre-predicate argument, post- 
predicate argument, or left unexpressed, and all three types are observed in the con- 
versation. Examples of post-predicate realization already discussed are the broad-focus 


Hin some Austronesian languages spoken in eastern Indonesia, such as Kambera (Klamer 1998: 69-70) and 
Bima (Satyawati 2009: 92), this type of co-occurrence of clitic pronoun and argument is permitted. 
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example (40) as well as the narrow-verb-focus clauses A36 / (39) and (45). A further ex- 
ample is (54). Zero marking was shown in Ass / (39) and can also be witnessed in (55). Fi- 
nally, pre-predicate subject realization was demonstrated in connection with contrastive- 
topichood in (46). As expected, there is never a clitic on any of these predicates. 


(54) Quad: {How did the person who came look like?) 


> Ana: [[barangkali, lenge benar]r [ade datang ta] T]- 
maybe ugly really REL come this 


"Maybe the person who came was really ugly: 
(55) Qz: {How was it to start working at 7:15 a.m.?} 


> A214: [[disiplin benar]r]- 
disciplined really 


‘It was really disciplined? 


5.2 Transitives 


According to Table 3, §2, the A argument can be realized as either a pre-predicate argu- 
ment, a post-predicate PP, or simply as a pronominal clitic. The post-predicate PP may 
co-occur with a clitic, while a pre-predicate argument may not. However, post-predicate 
PPs were not found in our data. 

The two remaining options, pre-predicate argument and clitic, are expected to occur in 
complementary distribution. In our data, clitics are particularly frequent in combination 
with a first person A, as shown, for instance, in (51) above. Another example, which 
shows a clitic on the predicate of the main clause and a preverbal pronoun (no clitic) in 
the embedded clause, is given in (56). 


(56) Qs: {What should we do towards the Japanese people concerning the consump- 
tion of pork?} 


DIt can be observed that the post-predicate PP argument typically occurs in narrative sentences, indicating 
a series of events in which several people take turns at being the agent. In such cases, an agent, despite 
being salient in the preceding utterance, needs to be expressed for the sake of disambiguation. (i) is a typical 
example cited from a folktale in Jonker (1934: 214). 


(i) ya=beang mo ling Salam lako guru; meng ka=ya=kakan mo ling guru: e, 
3=give  PrCby Salam to teacher; when Psr-3-eat prcby teacher ITJ 
nyaman tai asu ode” 
delicious shit dog small 


“Salam gave (it) to the teacher. When the teacher ate (it), (he said): “Ah, the shit of the small dog 


> 


is delicious. 
In our present conversation data the speaker is mainly talking about his own experiences. There are no 


comparable cases that would require agent disambiguation. Therefore, a broader analysis of narrative data, 
like folktales, is required to verify this particular function of the post-predicate PP. 
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> As,: [harus ` [tu-bada]r [nya]r luk kita no baukakan nan]- 
have.to iPL-tell 3 thatiPLINCL NEG can eat that 


"We must tell them that we can't eat that. 


Apparently, the speaker felt the need to realize the — given - agent of the embedded 
clause by use of the overt pronoun kita. Note that in this construction the entire clause 
is backgrounded. It is very likely that the occurrence of the preverbal pronoun is meant 
to convey a so-called secondary contrast, i.e. “We must tell them thatwe can't eat that 
(although you can)? So, once more, the entity is not just topical but implicitly contrastive. 

Furthermore, according to the rules spelled out in 82, clitics (in all persons) are ex- 
pected to appear instead of "zero-marked" transitive clauses, but this is clearly not al- 
ways the case in our conversation. There are several instances of transitive clauses in 


which neither a clitic nor an overt agent occurs, like (57). 


(57) Qo: {What kind of chili did the speaker see how often?) 
> Qo 1: (What kind of chili did the speaker see rarely?) 


> Agi: [[jarang]cr gita cabe [mira]r]- 
rarely see chili red 


‘T rarely saw red chili? 


There are also a few preverbal third person A arguments, like (58), which all seem to 
mark a contrastive (or at least a non-continuous) topic. 


(58) Qı: {What did the people in various places make?) 
> Q 1: {What did the people at the factory make?) 


> Ay: [Jadi tau [analcr pina, [mara leng tau ` nana;]wa; pina 
then person overthere make like word person over.there make 
[peralatan untuk elepator]r]- 
tool parts elevator 


“At the time, people there, as they said, were making parts for an 
elevator. 


Finally, the P argument may occur as either pre-predicate argument or post-predicate 
argument. In all cases found, pre-predicate P arguments either correspond to the focused 
constituent (argument focus), as in examples (41) and (42), or to the contrastive topic, 
A374 / (47), while a post-predicate P argument either belongs to the background, (49) or 
(56), is included in a (wide or mid-size) focus spanning the entire VP constituent, (51), or 
represents the focus in a CT-F combination, as shown in (57) and (58). A zero marked P 
argument is, as one would expect, one that is always clear from the context, see Au in 
example (59). 


léThe current QUD framework does not consider the phenomenon of second-occurrence focus (Beaver & 
Velleman 2011; Büring 2015; Baumann 2016) and will, therefore, not mark any secondary foci within the 
background, since there is no nesting of focus domains. 
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(59 Qu: {How did we get there?) 


> An: [[olo]r [pang nan] 7]~ 
putin place that 


‘(They) put (us) in that place’ 


6 Summary and conclusions 


We have investigated conversational corpus data from Sumbawa and demonstrated how 
information structure, determined on the basis of contextual-pragmatic constraints and 
the concept of questions under discussion, is related to constituent-ordering patterns 
known from earlier morphosyntactic descriptions of the language. In order to present 
the findings in a more systematic way, we revisit Table 2 (now Table 4) and Table 3 (now 
Table 5) from 82 and include our new findings about information structure. 
Summarizing, we can say that, by default (i.e. in the case of broad focus), the argu- 
ments follow the verbal predicate. Arguments occur pre-predicatively whenever any of 
them is narrowly focused and there is no contrastive topic present. In addition to that, 
a contrastive topic may also occur pre-predicatively. We found several candidates for 
potential contrastive topics. Since in all the cases the contrast can only be construed 
implicitly, we acknowledge that some readers may prefer to use the terminology 'non- 
continuous topic' instead, although we see no essential difference between a contrastive 
topic (with or without an overt alternative) and the shift to a new topic, since from a 
pragmatic point of view, both require the formulation of a new QUD about this topi- 
cal entity. In either case there is no topical continuity. As we said, both focus and con- 
trastive/non-continuous topic may occur in the pre-predicate argument slot. This corre- 
spondence is very common cross-linguistically, since “the beginning of a sentence is a 


Table 4: Intransitive predicates: constituent order, subject realization and in- 
formation structure 


15t/2nd person 3'd person 
no overt [[V]F]~ 
subject cf. (52) (39, 55) 
y [[V S]r]- 
post-predicate (40,) 
S-NP * [[VTe [S]r]- 
(39, 45, 54) 
[S]e V]~ 
pre-predicate (not attested) 
S-NP [[S]cr [V]F]~ 
(53) (46) 
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Table 5: Transitive predicates: constituent order, agent realization and infor- 
mation structure 


15t/2°4 person 3'd person 
[[V]r]- 
(not attested) 
[[V P]p]~ 
no (51) 
overt [[P]e V]- 
agent (41, 42) 
[[P]cr [V]r]- 
cf. (47) 
[[V]r [P]r]- 
(56) 
post-predicate (not enough data) 
A-PP (ling) 
pre-predicate [[A]cr V [P]g]- 
A-NP (58) 


(no other patterns attested) 


highly prominent position" (Kroeger 2004: 139) and both cr and r share the property of 
evoking alternatives (Büring 2003). However, the two pragmatic roles are usually distin- 
guished phonologically in Sumbawa: a pre-predicate NP in focus occurs with a falling 
intonation, while the contrastive topic expression occurs with a rising intonation. 

We have demonstrated how the use of the QUD-tree framework can be successfully 
applied to conversational data from a lesser-studied language like Sumbawa. This is the 
first comprehensive application of this method to a non-European language. Not only 
were we able to provide an in-depth analysis of the information-structure system of the 
language but we could also show that the method is generally a valuable instrument 
to explain some of the constituent-order variation and the occurrence of clitics. With 
this method, we gain access to the information structure phenomena and corresponding 
syntactic properties contained in various kinds of text and dialogue data. This means 
that we are now able to investigate more potential variation on more natural data than 
when studying information structure only in elicitation experiments. Some ofthe variety 
may be explained by semantic-pragmatic factors, while others may reflect an ongoing 
grammatical change in Sumbawa. 

We are confident that the phenomena and variety discovered in this study are the 
beginning of a series of new discoveries in the field of pragmatic data analysis on larger 
and more varied natural corpora. 


T Compare also Latrouite & Riester (2018 [this volume]). 
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- focus domain NEG negation 

CT contrastive topic PL plural 

EXCL exclusive PST past tense 

F focus Q question particle 
HON honorific QUD question under discussion 
IMP imperative REL relativizer 

INCL inclusive SG singular 

ITJ interjection T aboutness topic 
NAI non-at-issue 
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Preposed NPs in Seediq 
Naomi Tsukida 


Aichi Prefectural University 


Seediq is an Austronesian language spoken in northeastern Taiwan. Its word order is VXS 
in general (where X stands for adjuncts or arguments other than the subject), but an NP that 
has some semantic relation to the matrix clause can precede the matrix clause. This NP is 
followed by the particle ’u and a non-final pause. I will call such NPs preposed NPs. This pa- 
per will investigate the nature of these preposed NPs. What is their semantics and function? 
What type of NPs can be preposed? What are their anaphoricity and topic-persistence prop- 
erties? Preposed NPs are often coreferential with the matrix subject, which may or may not 
be overt. When a preposed NP is coreferential with the main clause subject, and the matrix 
subject is not overt, then the seeming word order is SVX. When the preposed NP is coref- 
erential with the matrix subject, and that subject is overt, then the seeming word order is 
SVXS. How do SVX and SVXS differ? What is the function of this double reference in SVXS? 
In addition, how do these word orders differ from simple VXS order? 


1 Introduction 


Seediq is an Austronesian language, spoken in northeastern Taiwan and belonging to 
the Atayalic subgroup. There are three Seediq dialects: Teruku, Tekedaya, and Tuuda. 
The research reported here is based on the Teruku dialect, which is mainly spoken in 
Hualien County. The population of the Teruku subgroup is about 30,000, but the younger 
generations do not speak the language. 

In what follows 81.1 explains Seediq word order. Subject and voice are covered in 81.2, 
non-subject arguments in $1.3, and the ’u particle in 81.4. 


11 Word order 


The basic constituent order in Seediq is VXS (X stands for adjuncts or arguments other 
than the subject). In example (1), k<em>erut! ‘<av>cut’ is the V, bunga ‘sweet potato’ is 
a non-subject argument, and payi old woman' is the subject. Ka is multi-functional. Its 
function here is to mark the subject. 


Wf phoneme inventory of Teruku Seediq is as follows: p, t, k, q, °’, b, d, s, x, h, g (voiced velar fricative), c, 
1 (voiced lateral fricative), r, m, n, ng (velar nasal), w, y, a, i, u, and e (schwa). 


Naomi Tsukida. 2018. Preposed NPs in Seediq. In Sonja Riesberg, Asako Shiohara & 
| Atsuko Utsumi (eds.), Perspectives on information structure in Austronesian languages, 
313-344. Berlin: Language Science Press. Z€ : 
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(1) AV 
K<em>erut bunga ka payi. 
<Av>cut  sweet.potato sBJ old.woman 


"Ihe old woman cuts sweet potato: 


In NPs, modifiers other than numbers typically follow the head. See $3.1 for details. 

Seediq has two sets of clitic pronouns, NOMINATIVE and GENITIVE, and two sets of 
independent pronouns, NEUTRAL and OBLIQUE. While all the pronouns have a GENITIVE 
clitic, the NOMINATIVE clitic is limited to the first and second persons. Clitic pronouns are 
second-position clitics, following the first element ofthe predicate. Some of the examples 
are -ku in example (2) and =na in examples (6), (7), and (8). 


(2) Clitic pronoun 
K<em>erut=ku bunga (ka yaku). 
<AV>Cut=1SG.NOM sweet.potato SBJ 1SG 


‘I cut sweet potato: 


One can omit adjuncts and arguments including the subject if they are recoverable from 
the context. Clitic pronouns are enough to indicate arguments, so when there is a nomi- 
native clitic in the sentence, it does not need to be referred to by an independent pronoun. 

In addition to the basic VXS order, Seediq can have an NP preceding the matrix clause, 
followed by the particle 'u and a non-final pause. In example (3), payi ‘old woman’ is 
preposed and followed by the ’u particle and a non-final pause. The matrix subject ka 
payi may be omitted, because, even if it is omitted, it is identifiable as coreferential with 
the preposed NP. 


(3) Preposed NP 
Payi "u k<em>erut bunga (ka hiya). 
old.woman PRT <Av>cut  sweet.potato sBJ 3SG 


"Ihe old woman, she cuts sweet potato: 


As a result, SVX order (more precisely, S 'u, V X (ka S)) is realized, though I do not regard 
the preposed NP as a subject. This paper will investigate the nature of such preposed NPs. 


1.2 Subject and voice 


The subject appears in the Neutral case, preceded by the particle ka, in clause-final po- 
sition. Ka is multi-functional; some of its functions are subject-marker, linker, and com- 
plementizer. Its usage as a linker is yet to be investigated. 

In Seediq, thematic roles fall into three groups, according to the verb form they trig- 
ger when they are chosen as the subject. The first group, the A group, includes AGENT, 
THEME, and EXPERIENCER; these trigger AV forms, as in (1) in 81.1. The second group, the 
G group, includes PATIENT, GOAL, LOCATION, and RECIPIENT; these trigger GV forms, 
as in (4). The third group, the C group, includes CoNvEvED THEME, INSTRUMENT, and 
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BENEFICIARY; these trigger CV forms, as in (5). In (4) and (5), the patient bunga 'sweet 
potato’ and the instrument yayu ‘knife’ is the subject, respectively, and the Agent payi 
‘old woman’ is not the subject anymore. 


(4) GV 
Kerut-un payi ka bunga. 
cut-cvi old.woman sBJ sweet.potato 


"Ihe/An old woman will cut the sweet potato’ 


(5 CV 
Se-kerut bunga payi ka yayu. 
cv-cut sweet.potato old. woman spy knife 


"Ihe/An old woman cut the/a sweet potato with the knife’ 


Patient and location are grouped together; forms suffixed with -un are GV1, and those 
with -an are GV2 forms. Corresponding forms in other languages are often regarded 
as a patient voice and a location voice, respectively (Huang & Hsinsheng 2016: 78, for 
example), but in Teruku-Seediq, their usage is not so straightforward. Even when the 
thematic role of the subject is location, if one wants to express a future event, the -un 
form is used, for example. Moreover, even when the thematic role of the subject is patient, 
the -an form may be used if one wants to express a progressive or habitual event. See 
Tsukida (2012) for more details. 

When the subject is a pronoun, it triggers a nominative clitic pronoun after the first 
element of the predicate (see example (2)). 

There are cases where the thematic role of a ka-marked NP and the verb form do not 
correspond. For example, the verb is in AV form but the thematic role of the ka-marked 
NP is patient. In such cases the ka-marked NP cannot be regarded as the subject. Such 
cases exist, though they are rare. 

In addition to triggering the nominative clitic pronoun, subjects show several distinc- 
tive morpho-syntactic properties (see Tsukida 2009 for details). 


13 Non-subject arguments 


An NP belonging to the A group (the group of NPs with thematic roles that trigger 
AV) is realized by the genitive clitic if it is a pronoun, when it is not the subject. The 
A arguments in (4) and (5) are not subjects. If one replaces them with pronouns, they 
would be as (6) and (7), respectively. 


(6) GV, with pronominal A 
Kerut-un-na ka bunga. 
cut-GV1=3SG.GEN SBJ sweet.potato 


‘He/She will cut the sweet potato: 
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(7) CV, with pronominal A 
Se-kerut-na bunga ka yayu. 
CV-cut-3SG.GEN sweet.potato sBJ knife 


"Ihe/An old woman cut the/a sweet potato with the knife’ 


As for nouns, they do not have a distinct genitive form; the genitive form is the same 
as the neutral form. We can regard payi 'old woman' in (4) and (5) as a genitive form, 
though it is formally the same as a neutral form. 

When it is not the subject, an NP belonging to the G or C group (the group of NPs with 
thematic roles that trigger GV or CV) is realized as OBLIQUE. The oblique form, however, 
is not distinct from the neutral form, except for pronouns and nouns with high animacy. 
For pronouns and NPs with high animacy, the oblique form usually involves the suffix 
-an. The occurrences of bunga 'sweet potato' in (1), (2), (3), (5) and (7) are regarded as 
oblique forms, though they are homophonous with neutral forms. Sediq-an 'person-oBr' 
in (8) is an example of a distinct oblique form. 


(8) CV 
Se-kerut-na sediq-an ka yayu niyi. 
CV-cut-3SG.GEN person-OBL SBJ knife PROX 
‘He/She cuts the/a person with this knife: 


14 The ’u particle 


As mentioned in $1.1, an NP may appear preceding a matrix clause, followed by the 
particle `u and a non-final pause. The particle may be ga, deu, or dega, in addition to v. 
Ga is interchangeable with “v. I will use only ’u in the remainder of the paper. De'u and 
dega are interchangeable, as they are derived from ’u and ga by affixation of de-. De'u 
and dega are used differently from ’u and ga. I will not treat the use of deu and dega in 
this paper. 

What can precede ’u is not only an NP but also a clause. In example (9), two clauses 
are connected by “u. 


(9) Clause-A ’u, clause-B. 
M-iyah-su hini "u, me-qaras-ku. 
AV-come=2SG.NOM here PRT Av-be.glad=1sG.NOM 


“If you come, I will be glad: 


’u is multifunctional, and the two clauses connected by it may have several kinds of 
relationships. In this sentence, the preposed clause is a conditional for the event denoted 
by the matrix clause. One can add an appropriate adverb to express the relationship 
overtly. In example (10), for example, nasi ‘if’ is added to express the conditional meaning 
overtly. 
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(10) With nasi ‘if? 
Nasi-su | m-iyah hini wu me-qaras=ku. 
if-2sG.NOM Av-come here PRT Av-be.glad-1sc.NoM 


‘If you come here, I will be glad? 


15 Texts 


In the next section, I will investigate the nature of the preposed NPs mentioned in 81.1 
from the point of view of semantics/function, NP types, anaphoricity, and topic persis- 
tence, analyzing Seediq texts. 

For my analysis, I used parts of Sufferings of the Teruku church written by Yudaw 
Pisaw (Tien Shin-de). In these texts, matrix clause subjects occur five times more fre- 
quently than preposed NPs. I started checking from the beginning and when I reached 
3728 words, 262 matrix clause subjects and 55 preposed NPs had been found. I therefore 
stopped counting matrix clause subjects but continued counting preposed NPs, up to 
7315 words, and found 38 more instances of preposed NPs. In total, I have 93 instances 
of preposed NPs. 


Table 1: Matrix subjects and preposed NPs in texts 


Words Matrix subject Preposed NP 
1-3728 262 55 
3729-7315 — (stopped counting) 38 
Total 262 93 


2 Semantics/function of preposed NPs 


It is not the case that one can freely choose an NP to precede the matrix clause. Preposing 
is usually restricted to the following NPs: 


e An NP that is coreferential with the matrix subject (preposed subject) 


e An NP that is coreferential with the A-argument in the matrix clause (preposed 
A) 


e An NP that expresses the place of existence in an existential construction, or the 
possessor in a possessive construction, or a possessor that is left-dislocated from 
the matrix subject (preposed possessor, see 82.3 for details) 


e An NP that expresses time (preposed time) 
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* Choices in alternative questions are preposed even when they are not subjects, 
A-arguments, times, places, or possessors (preposed alternatives) 


Some preposed NPs apparently do not fit the above criteria. They are not coreferential 
with a subject, an A argument, or a possessor in the matrix clause; nor are they frames 
or alternatives. These will be discussed in $2.6. 

Among 93 instances of preposed NPs, 66 instances have a preposed NP that is corefer- 
ential with the subject of the clause (preposed subject); in four instances it is coreferential 
with an A-argument (preposed A), in one instance it is a possessor (preposed possessor), 
and in 16 instances it is time (preposed time). There were six instances where it was diffi- 
cult to judge the exact function of the preposed NP. These 93 instances are summarized 
in Table 2. 


Table 2: Semantics/function of preposed NP 


Token % 

Preposed Subject 66 70% 
Preposed A 4 676 
Preposed Possessor 1 1% 
Preposed Time 16 17% 
Preposed alternative 0 0% 
None of the above 6 6% 
Total 93 100% 


I will explain each of the above cases in turn. 


2.1 Preposed subjects 


One can prepose an NP that is coreferential with the matrix subject, as in (3) and (11). I 
will call such cases PREPOSED SUBJECTS, though preposed NPs are not subjects, actually. 


(11) Preposed subject 
Niyi; u, "adi ’utux=ta (ka kiya;). 
PROX PRT NEG God=1PLIN.GEN SBJ it 


“As for this, it is not our God? 


One can omit the matrix subject, as shown in (11). 

When an NP that is coreferential with the subject appears in pre-clausal position, the 
subject in the regular clause-final position is often omitted. In the text, 13 of 66 preposed 
subjects had overt subjects, and 53 instances had covert subjects. I will investigate in 
§3.3, §4.3, and §5.3 whether there is any difference between these two cases. 

With a verbal predicate, “subject” means the NP that triggers the verb form, appearing 
in clause-final position preceded by the subject particle ka. Payi ‘old woman’ in (1), bunga 
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‘sweet potato’ in (4), and yayu ‘knife’ in (5) are examples of subjects. For sentence (1), one 
can put payi 'old woman' in front of the matrix clause, as in (12), but one cannot do the 
same for bunga 'sweet potato; as in (13). 


(12) One can prepose a subject argument 
Payi "u k<em>erut bunga (ka hiya). 
old.woman PRT <Av>cut  sweet.potato SBJ 3SG 


"Ihe old woman, she cuts sweet potato: 


(13) One cannot prepose a non-subject argument 
*Bunga èu, k<em>erut ka payi. 
sweet.potato PRT <Av>cut sBJ old.woman 


"Ihe old woman cuts sweet potato: 


For sentence (4), one can prepose bunga ‘sweet potato’, as in (14). 


(14) GV 
Bunga ’u__kerut-un payi (ka bunga). 
sweet.potato PRT cut-Gv1 old.woman sBJ sweet.potato 


"Ihe/An old woman will cut the sweet potato’ 


A preposed subject is not an actual subject, as I have indicated, but only coreferential 
with the matrix subject. It seems to share some of the subject properties mentioned in 
§1.2, however. A nominative clitic that corresponds to the preposed NP appears in the 
main clause, for example. 


(15) One can prepose a subject argument 
Yaku ’u, k<em>erut=ku bunga (ka yaku). 
1SG PRT <AV>cut=1SG.NOM sweet.potato SBJ 1SG 


‘As for me, I cut sweet potato’ 


We cannot tell, however, whether it is the preposed NP yaku or the subject of the matrix 
clause that triggers the clitic. 

When the preposed NP is coreferential with the matrix subject, the matrix subject may 
either appear again or be omitted. For 13 of 66 preposed subject, matrix clause subject is 
overt and for 53 it was not overt. When it is overt, it is realized either by hiya, the 3rd 
person singular pronoun, by dehiya, the 3rd person plural pronoun, or by kiya ‘it, so’. In 
example (16), Karaw Wacih (proper name) appears in preposed position and at the end 
of the matrix clause, it is referred to again by a pronoun ka hiya. 


(16) Preposed subject with overt matrix subject 
Siida ka Karaw Watih niyi 'u adi ka balay=bi senehiyi Yisu Kiristu 
then LNK Karaw Wach prox PRT NEG LNK really-really Av.believe Jesus Christ 
ka hiya niyana. 
SBJ 3SG yet 


‘At that time Karaw Wacih, he did not really believed in Gospel yet: 
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I will call this pattern S *u, VX ka hiya. In this denotation, hiya represents hiya, dehiya 
and kiya. The pattern where the matrix subject is not overt, on the other hand, will be 
labeled S 'u, VX. 


2.2 Preposed A 


Here by “A” I mean the group of NPs which are not the subject but whose thematic roles 
trigger AV when they become the subject. A pronoun is in the genitive case when it 
appears as non-subject, as in (6) and (7). 

An A argument can be preposed to the matrix clause. In a sentence like (5), for example, 
one can prepose payi 'old woman', as in (17), because its thematic role is agent. The 
genitive clitic pronoun =na, which is coreferential with payi appears in the matrix clause. 


(17) CV 
Payi "u se-kerut=na bunga ka yayu. 
old.woman PRT cv-cut-3sG.GEN sweet.potato sBJ knife 


‘As for the old woman, she will cut the/a sweet potato with the knife’ 


There are four such instances among the 95 preposed NPs. 

Preposing A is not allowed if there is no X (non-subject argument). In (18) there is only 
the A argument Kumu (a person’s name) as a non-subject argument. From this sentence, 
one cannot prepose A, as shown in (19). 


(18) A as non-subject argument 
B<en>arig Kumu ka patas niyi. 
<CV.PRF>buy Kumu sept book prox 


“Kumu bought this book’ 


(19) One cannot prepose A 
"kumm 'u, b<en>arig=na ka patas niyi. 
kumu PRT <CV.PRF>buy=3SG.GEN SBJ book PROX 


‘As for Kumu, she bought this book./As for Kumu, this book was bought by her’ 


A time expression is not enough to enable preposing of A. Even if sehiga ‘yesterday’ is 
X, one cannot prepose A, as shown in (20). 


(20) One cannot prepose A 
“Kumu 'u, b<en>arig=na sehiga ka patas niyi. 
Kumu PRT «cv.PRF-buy-3sG.GEN yesterday sBJ book this 


‘As for Kumu, she bought this book yesterday? 


2.3 Preposed possessor 


One of the morphosyntactic properties of subjects is that they license possessor left- 
dislocation. From a subject NP consisting of a noun and a possessor, one can move out 
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the possessor and put it in front of the clause. Examples (21) and (22) exemplify this. In 
(21) Masaw (the name of a male person) is part of the NP tederuy Masaw 'Masaw's car’, 
which is the subject of the sentence. In (22), on the other hand, Masaw is preposed, and in 
the subject a genitive clitic pronoun is present which is coreferential with the preposed 
Masaw. 


(21  Me-gerung ka tederuy Masaw. 
Av-be.broken sBJ car Masaw 


*Masaw's car is broken? 


(22) Masaw 'u, me-gerung ka tederuy-na. 
Masaw PRT Av-be.broken sBJ car=3SG.GEN 


“As for Masaw, his car is broken? 


There is one such instance among the 93 preposed NPs. 


2.4 Preposed time 


Preposed time sets the temporal frame in which the following expression should be in- 
terpreted. An example is (23). 


(23) Preposed time 
Ya'asa diyan “u, m-iyah r<em>igaw s<em>bu ka gen sekiya 
because day:time PRT Av-come <Av>hang:about <av>hit spy ship sky 
"Amirika heki. 
America REASON 


‘Because in the daytime, American airplanes come to make an air raid? 


There are 16 such instances among the 93 preposed NPs. 


2.5 Preposed alternatives 


For alternative questions, alternatives are preposed, as in (24). 


(24) Preposed Alternatives 
Deha niyi “u, 'ima ka sewayi=su? 
two PROX PRT who sBJ younger:sibling-2sG.GEN 


"Between these two, who is your younger sibling?' 


There was no such example in the text I used in this investigation. 
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2.6 None of the above 


Some preposed NPs apparently do not belong to any of the classifications listed above. 
Such NPs rarely become subjects, but examples do exist. In (25), saw niyi ‘such things’ 
is a patient, not A, of the verb me-kela ‘to know’, notionally. It is not a subject, either, 
because the predicate verb is AV, not GV, which would have a patient as its subject. Nor 
is it a possessor, time or expression of alternatives. 


(25) None ofthe above 
Saw niyi "wu dima me-kela ka nihung. 
like PROX PRT already Av-know sBj Japan 


“As for such things, Japan already knew? 


There were six such instances in the text I analyzed. 


2.7 Discussion 


Let us examine here whether these preposed NPs are "topics" of some sort. 

Preposed subjects, preposed As, and preposed possessors, shown in $2.1, 82.2 and 82.3, 
seem to serve as aboutness topics (Krifka 2007: 40-41). They indicate what the informa- 
tion denoted by the matrix clause (-comment) is about. Examples (12) and (17), for ex- 
ample, are about payi ‘old woman’. The matrix clause tells the information about the 
preposed NP. 

Preposed time and preposed alternative, on the other hand, seem to serve as frame 
setters. The frame setter indicates that the information actually provided is restricted 
to the particular dimension specified (Krifka 2007: 47). In example (23), the information 
provided by the matrix clause ‘American airplanes come to make an air raid’ is restricted 
to the particular dimension specified by the preposed NP diyan ‘daytime’, for example. 

For those elements that are neither preposed subjects, preposed As, preposed posses- 
sors, preposed time or preposed alternatives, still more research is necessary, but as for 
example (25), the preposed NP, saw niyi euch things' seems to serve to set frame for the 
information provided by the matrix clause, ‘Japan already knew’. 


3 NP types 


In this section, I will investigate the composition of preposed NPs. 


3.1 Preliminaries 


In Seediq NPs, a noun (a common noun or a proper noun) or a pronoun usually functions 
as the head, but demonstratives and VPs can also function as referential expressions, 
without any affix or particle. Seediq is quite flexible, in the sense of van Lier & Rijkhoff 
(2013). 
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(26) A noun as a referential expression 
Q«em» ita-ku kuyuh ka yaku. 
<AV>see=1SG.NOM woman SBJ 1SG 


‘I saw a/the woman: 


(27) A demonstrative as a referential expression 
Me-gerung ka gaga. 
Av-be:broken SBJ DIST 
“That is broken? 


VPs also are used as referential expressions without any additional affixation or particle. 
AV forms mean ‘one who does the action denoted by the verb’/‘one who bears the state 
denoted by the verb’; GV forms mean ‘the object to which the action denoted by the 
verb is done’/‘the place where the action denoted by the verb takes place’/‘the recipient 
who recieves something’/‘the goal toward which the action denoted by the verb aims’/ 
etc.; CV forms mean ‘the instrument by which the action is done’/‘the beneficiary for 
whom the action is done’/‘the object that is transferred'/etc" A CV-perfect form may 
mean ‘what has been done as the action denoted by the verb’ (see (48)). Mpe-tegesa kari 
‘the one who teaches language’ in (28) is an example of an AV form used as a referential 
expression. 


(28) A VP as a referential expression 
Q«em» ita-ku mpe-tegesa kari ka yaku. 
<AV>See=1SG.NOM AV.FUT-teach language sBJ 1G 


‘I saw the language teacher. 


When the head is a noun, it can be modified by demonstratives, numbers, genitive pro- 
nouns, another noun, or relative clauses. Modifiers, except for numbers, typically follow 


the head. 


(29) A demonstrative modifying a noun 
Q<em>ita=ku kuyuh gagaka yaku. 
<AV>see=1SG.NOM woman DIST SBJ 1SG 


‘I saw that woman: 


(30) A number modifying a noun 
Me-gerung ka teru tederuy. 
Av-be:broken ssy three car 


"Ihe three cars are broken: 
(31) A genitive pronoun modifying a noun 


Q«em» ita-ku kuyuh-na ka yaku. 
<AV>Se€e=1SG.NOM Wife=3SG.GEN SBJ 1SG 


‘I saw his wife’ 
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(32) 


(33) 


Another noun modifying a noun 
Me-gerung ka sapah qehuni. 
AV-be:broken spy house wood 


“The wooden house is broken? 


A relative clause modifying a noun 
Q<em>ita=ku kuyuh mpe-tegesa kari ka yaku. 
<AV>see=1SG.NOM woman AV.FUT-teach language sBj 18G 


‘I saw the woman who will teach language’ 


I classified NPs into the following types, and investigated how they appear in preposed 


position. I also investigated their distribution as subjects and made comparisons. 


Personal pronoun (PP) 

Demonstrative (D) 

Proper noun + niyi (this) (PD) 

Common noun + niyi (this) (CD) 

Bare proper noun (BPN) 

Common noun + Genitive (CG) 

Common noun + Relative clause/Attributive (CR) 
Common noun (CN) 

Saw nominal (Saw) “Things like ..? 


VP 


This classification and ordering are based in part on Gundel et als (1993) classification 
which claims that the form of a given NP reflects the givenness of the NP and show the 
Givenness Hierarchy (Gundel et al. 1993: 275). 


uniquely type 
in focus > activated > familiar > identifiable > referential » identifiable 
that 
{it} this {that N} {the N} {indefinite this N} {a N} 
this N 


Figure 1: Givenness hierarchy (Gundel et al. 1993: 275) 


In Seediq, too, a personal pronoun (PP) may reflect in focus status, which is at the left- 
most position in the hierarchy, demonstrative (D) may reflect activated status, common 


noun + demonstrative (CD) may reflect activated or referential status, and so on. As 
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Seediq does not have a definite article or an indefinite article, it is not easy to identify 
each NP as uniquely identifiable or type identifiable 

At this stage, it is an open question whether and how the different composition in 
Seediq NPs reflects givenness. This is still under investigation, and the classification 
above is only a preliminary step. Though still to be examined more thoroughly, I tried 
ordering them so that it would correspond, at least partially, to the givenness hierarchy 
by Gundel et al. (1993). 

I will explain my classification and ordering below. 

First is an NP consisting of a personal pronoun. In Gundel et al. (1993), this is an 
indication of in focus status. An example of a personal pronoun preposed to a matrix 
clause: 


(34) Personal pronoun (PP) 
Hiya u, ya'a-bi teru 'idas ka ne-niq-un-na hiya ni 
3SG PRT UNCERTN-really three month spy FUT-live-Gv1=38G.GEN there and 


“Then she, it was about three months that she would live there, and...” 


Next is an NP consisting of a demonstrative. In Gundel et al. (1993), this is an indication 
of activated status. An example of a demonstrative preposed to a matrix clause: 


(35) Demonstrative (D) 
Niyi u, “adi ’utux=ta ka niyi. 
PROX PRT NEG God=1PLGEN SBJ PROX 
“This, this is not our God? 


I also treated the time expression si'ida ‘then, at that time’ as demonstrative. 


(36) Demonstrative (D) 
Siida 'u, me-seseli ngangut sapah `Umih Yadu. 
then PRTVAv-gather outside house PN PN 


‘At that time, they gathered outside of the house of Umih Yadu’ 


Next is an NP consisting of a proper name and a demonstrative (PD). Gundel et al. (1993) 
do not include such a form. An example of a proper noun modified by a demonstrative 
appearing in front of the matrix clause: 


(37) Proper noun + niyi (PD) 
Ya'asa Tiwang niyi “u, m-en-niq  degiyaq qawgan. 
because Ciwang PROX PRT AV-PRF-live mountain Qawgan 


‘Because this Ciwang, she was a Teruku, and she had lived at Mt. Qawgan: 


Proper nouns modified by demonstratives seem to be activated or familiar. 

Next is an NP consisting of a common noun and a demonstrative. Gundel et al. (1993) 
point out the functional difference between this and that from the point of view of infor- 
mation structure. According to them, this N is an indication of activated status, but that 
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N indicates familiarity. I examined the corpus to determine whether the corresponding 
expressions in Seediq, N niyi (this N) and N gaga (that N), show a similar difference, but 
I found only instances of N niyi. There are no instances of N gaga in the text, but N gaga 
is often found in elicitation. An example of a preposed CD (common noun modified by 
a demonstrative): 


(38) Common noun + niyi (CD) 
Budi | pe-patas-mu niyi 'u, biq-i-ku haya 
bamboo rUT-CV.write-1SG.GEN PROX PRT give-GV.NFIN-1SG.NOM BEN 
kuyuh=mu ` rubiq wilang ha. 
wife=1SG.GEN PN PN GENTLE 


‘As for this fountain pen, please give it to my wife Rubiq Wilang: 


Common nouns modified by demonstratives (CDs) are supposed to be activated, familiar 
or uniquely identifiable. 
Next is an NP consisting of a proper name. An example: 


(39) Bare proper noun (BPN) 
Kiya den, nihung "nu hengkawas 19,14 m-adas | hebaraw-bi merata ni  kensat 
SO PRT Japan PRT year 1914 Av-bring many-really soldier and police 
m-iyah k<em>eremux dexegal Teruku. 
Av-come <Av>invade land ` Teruku 


"Ihen, Japan, in 1914, brought soldiers and police and came to invade the Truku 
territory: 


Things or persons denoted by proper names are at least uniquely identifiable. 
Next is an NP consisting of a common noun and a genitive pronoun. The genitive 
expresses a possessor, as illustrated in (40). 


(40) Common noun + Genitive pronoun (CG) 
Bubu-na "u wada  m-arig bawa da. 
mother-3sG.GEN PRT be:gone Av-buy bun Ns 


"His/Her mother went to buy a bun’ 


Common nouns modified by genitive pronouns (CGs) are supposed to be uniquely iden- 
tifiable or referential. 

Next is an NP consisting of a common noun and a relative clause or attributive ex- 
pression. In Seediq, a noun may be modified by another noun, and the relationship be- 
tween them is possessor-possessed, material-thing (e.g., paper napkin), purpose-thing 
(e.g., traveling shoes), whole-part (e.g., face tattoo), product-producer (e.g., his book, 
meaning ‘the book he wrote’), etc. I collectively call those meanings that are expressed 
by the modifiers illustrated "attributive". “Relative clause" is actually a VP modifying a 
noun (see (42)). This type, therefore, is a class of NPs consisting of a noun and another 
noun or a VP modifying it. Examples are: 
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(41) Common noun + Other noun (CR) 
Kana ka | mensewayi de-senehiyi "uri "u, tetegeli'ing m-usa me-seseli 
all LNK brother  pi-believer also PRT Av.hide ^ Av-go Av-gather 
tehemuku. 
Av.worship 


"Ihe brothers and sisters of believers went together hiding to worship as well? 


(42) Common noun + VP (CR) 
Kensat nihung m-eniq Sekadang 'u, Matsudo-sang ni Motoyoshi-singsi 2 
police Japan  Av-live Sekadang prt Matsudo-Mr. and Motoyoshi-teacher two 
hiyi. 
body 
"Ihe Japanese police who were at Skadang, they were Mr. Matsudo and Teacher 
Motoyoshi, two people: 


Common nouns modified by another noun or VP (CRs) are also supposed to be uniquely 
identifiable, referential or type identifiable. 
Next is an NP consisting of a common noun alone, as illustrated in (43): 


(43) Common noun (CN) 
Ke'man 'u, pengkeku'ung sengkekingal m-usa 'ayug daya 'alang-deha 
night PRT in:the:darkness one:by:one Av-go stream upwards village=3PL.GEN 
ka sediq senehiyi — Kiristu. 
SBJ person Av-believe Christ 
“At night, people who believed in Christ went to the upper stream of their village, 
in the darkness, one by one: 


Bare common nouns may be uniquely identifiable, referential, ox type identifiable. Seediq 
does not have any definite article or indefinite article, so it is often hard to tell. 

Next is saw + noun. An NP is preceded by saw, a preposition meaning ‘like ..?; saw NP 
means 'things like an NP' or 'things concerning NP? Sometimes ka appears between saw 
and NP. An example: 


(44) Saw nominals 
Saw ka qaya samat "u des-un-deha be'nux, —seberig-an 
like LNK thing wild:animal PRT bring-Gv1=3PL.GEN plain:land sell-cv2 
kelemukan. 
Taiwanese 


“As for things concerning wild animals (animal skins, bones, horns, and the like), 
they brought them to the plains and sold them to Taiwanese: 


Saw * noun would correspond to type identifiable in givenness hierarchy by Gundel et al. 
(1993). 
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The last is VP. A VP can function as a referential expression, as in example (28). This 
type of expression may appear as the subject or as a non-subject argument, but rarely 
as a preposed element. (45) is another example of a VP functioning as a subject phrase. 
In this example, a VP n-arig=mu sehiga ‘<cv.PRF>buy=1SG.GEN yesterday’ means ‘what 
I bought yesterday: 


(45) VP as matrix subject 
Me-gerung ka n-arig=mu sehiga. 
AV-be:broken sBJ cv.PRF-buy=1sG.GEN yesterday 
"What I bought yesterday is broken’ 


Such VPs used as referential expression may be uniquely identifiable, referential or type 


identifiable, depending on the adjuncts contained in them. 


3.2 Results 


I investigated what type of NP appears for each type of preposed NP classified in $2. The 
results are shown in Table 3. 


Table 3: Types of preposed NPs, in terms of coreference 


Preposed Others Sum 
Subject Time A Po 

PP 2 0 0 0 0 2 
D 6 10 0 0 2 18 
PD 12 0 0 0 0 12 
CD 4 0 0 0 0 4 
BPN 20 0 2 0 0 22 
CG 6 0 1 1 0 8 
CR 4 0 0 0 1 5 
CN 4 6 1 0 0 11 
Saw+ noun T 0 0 0 3 12 
VP 1 0 0 0 0 1 
Interrogative 0 0 0 0 0 0 
Total 66 16 4 1 6 


There are several things I can point out from the results above. 

Personal pronouns rarely appear preposed. There are only two instances of them. 

For preposed subjects, person names, accompanied by a demonstrative (PD) or not 
(BPN), appear more often than the other types of NPs. 

For preposed time, there are 10 instances of si'ida ‘then, at that time. These are classi- 
fied as demonstrative. The other six instances are bare common nouns (CN). 
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Among those six instances that were not S, A, Po, or Time, three are of the type saw 
+ noun ‘things concerning noun. It seems easier for speakers to ignore the semantic 
relationship between the preposed NP and the voice of the predicate verb of the matrix 
clause when the preposed NP is saw + noun. Let us examine here the three cases more 
precisely. One is example (25). The other two are (46) and (47). 


(46) Saw + noun 
Saw niyi "uw berah 'ini 'iyah dexegal Taywan hini ka Nihung 
like PROX PRT before NEG AV.NFIN.come land Taiwan here spy Japan 
han. 
temporarily 


“As for such things, it was before Japan came here to the land of Taiwan? 


(47) Saw + noun 
Saw niyi "wu 'adi-nami t<em>egesaka yami kensat Nihung ‘uri pini, 
like PROX PRT NEG=1PLEX <AV>teach sBj 1PLEX police Japan also ANGER 
mesa. 
HEARSAY 


“As for such things, we Japanese police do not teach, they say? 


In these examples, saw niyi ‘such things’ seems to indicate rather abstract situations, not 
concrete entities. Such lack of concreteness seems to lead to inconsistency of the voice 
form of the matrix verb. 

Below, I will investigate whether the situation differs for preposed subjects if the ma- 
trix subject is overt or not. I will also compare the preposed subject situation with the 
situation of the matrix subject. 


3.3 Comparison 


Among sentences with preposed subjects, some have an overt matrix subject, and some 
do not. When the matrix subject is overt, the same referent is referred to twice, both in 
the topic position and in the regular subject position (S 'u, VX ka S). What is the function 
of this double reference? 

The preposed subject is coreferential with the matrix subject. How does the appear- 
ance of an NP in preposed position differ from an NP that is the matrix subject, in internal 
constituency? I will compare the NPs in these cases. 


3.3.1 When the matrix subject is overt and when it is not 


Among 66 instances of preposed subjects, 13 instances are S 'u, XV ka hiya pattern (with 
overt matrix clause subject), and 53 instances are S 'u, VX pattern (without overt matrix 
subject). The types of preposed NPs are shown in Table 4. 

There are considerable differences in internal constituency between S 'u, VX ka hiya 
pattern and S 'u, VX pattern. 
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Table 4: NP types of preposed subjects with overt matrix subjects and those 
without. 


Preposed subjects 
S'u,VXkahiya. S'u, VX. 


PP 0 0% 2 4% 
D 1 8% 5 9% 
PD 4 31% 8 15% 
CD 1 8% 3 6% 
BPN 6 46% 14 26% 
CG 0 0% 6 11% 
CR 0 0% 4 8% 
CN 0 0% 4 8% 
Saw+ noun 1 8% 6 12% 
VP 0 0% 1 2% 


Total 13 100% 53 100% 


We can first point out that what can appear as S of the S ’u, VX ka hiya pattern is very 
limited. Most of them (10 of 13) are PD or BPN. There is one example each of D, CD and 
saw + noun. There are no examples of PP, CR, CG, CN or VP. 

As for the PP (personal pronoun) type, for all preposed subjects there are very few 
instances, as we saw in §3.2. The two that occur lack an overt subject. Clauses with a 
personal pronoun as preposed subject and a matrix clause with an overt subject seem to 
be avoided. 

Among the 13 instances of preposed subjects with an overt regular-position subject (S 
"u, VX ka hiya type), 10 (76%) are instances of PD or BPN; that is, a proper name, with or 
without a modifying demonstrative, is introduced as the preposed NP and then referred 
to again in the matrix subject position by a pronoun, as in (48). 


(48) Tiwang niyi “u, Teruku ka hiya. 
Ciwang PROX PRT Teruku sBJ 3sG 


"Ihis Ciwang, she was a Teruku person: 


In the S ’u, VX pattern, on the other hand, PD and BPN represent only 41 percent of the 
cases. This is lower than the number of cases with the S ’u, VX ka hiya pattern (76%). 

As for the CR, CG, and CN types (common nouns modified by a VP or another noun, 
those modified by a genitive pronoun, and bare common nouns), there is no instance of 
the S ’u, VX ka hiya pattern. There is only one instance of the CD type (common noun 
modified by a demonstrative). We can thus say that common nouns rarely appear as 
preposed subjects of the S ’u, VX ka hiya pattern. 
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For the S *u, VX type, the proportion of common nouns as preposed NPs is higher (17 
among 53) than for the S 'u, VX ka hiya type. 

As for the saw + noun type, 6 of 7 instances of them occurred in the S ’u, VX type. 

We see in Table 4 that most instances of saw NP ’u (6 out of 7) occurred in the S ’u, 
VX pattern. 

There is one example of VP in preposed position (see the next section for this example). 


3.3.2 Preposed subjects and matrix subjects 


Now let us compare preposed subjects and matrix subjects. There are 262 overt matrix 
subject occurrences in the text. A comparison of NP types in preposed subjects and 
matrix subjects is given in Table 5 and visualized in Figure 2. 


Table 5: NP types of preposed subjects with and without overt regular-position 
subjects and matrix subjects. 


Preposed-Subject Matrix clause subjects 
S’u, VX ka hiya. S'u,VX. XV ka S 

PP 0 0% 2 4% 52 20% 
D 1 8% 5 9% 4 2% 
PD 4 31% 8 15% 10 4% 
CD 1 8% 3 6% 2 1% 
BPN 6 46% 14 26% 64 24% 
CG 0 0% 6 11% 49 19% 
CR 0 0% 4 8% 10 4% 
CN 0 0% 4 8% 48 18% 
Saw+noun 1 8% 6 12% 2 1% 
VP 0 0% 1 2% 21 8% 


Total 13 100% 53 100% 262 100% 


The PP type (personal pronouns) scarcely appears as a preposed subject, but for matrix 
subjects, the PP type is observed very often (20%). 

Demonstratives (D), proper names modified by demonstratives (PD), and common 
nouns modified by demonstratives (CD) appear as preposed subjects (22 out of 66, 33%), 
but not as often in regular subject position (16 out of 262, 6%). 

CD, CR, CG (common nouns modified by demonstratives, another noun, a VP, or a 
genitive pronoun) and CN (bare common nouns) are very common matrix subjects (109 
out of 262, 41%). For preposed subjects, they are much fewer (1 out of 14, 7%, for S of the 
Su VX ka hiya pattern and 17 out of 52, 32%, for the S of S ’u, VX pattern). 
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Figure 2: NP types of preposed subjects with and without overt regular- 
position subjects and matrix subjects. 


Personal names, bare (BPN) or with modification by demonstratives (PD), are less 
common for matrix subjects (74 out of 262, 28%) than for preposed subjects (10 out of 
13, 77%, for S of the S ’u, VX ka hiya pattern and 22 out of 53, 42%, for S of the S ’u, VX 
pattern). 

VPs can function as referential expressions, as we saw at the beginning of 83. Such ex- 
pressions appear in regular subject position, but rarely in preposed position. There were 
21 instances in which a VP appeared as a matrix subject, but only one of a prepositional 
phrase in pre-clausal position. Example (49) shows a VP functioning as subject. 


(49) VP as the subject 
Laqi-na "Ipay Yudaw ka q«em»«en» ita. 
child-3sc.GEN Ipay Yudaw SBJ <Av><PRF>see 


"Ihe one who saw it was her child Ipay Yudaw: 


We can say that NP ka VP is similar to a cleft or pseudocleft construction. Similar con- 
structions are reported for Philippine languages (Himmelmann 2005: 140, Nagaya 2011: 
604), and for many other Formosan languages as well. 

A preposed VP is usually interpreted as an adverbial-clause predicate with the subject 
omitted, and not as a referential phrase. This is because 'u is multifunctional and can 
mark adverbial clauses as well as preposed subjects, as we saw in 81.4. In the texts, there 
was one example of a preposed VP that could be interpreted as a referential expression 
(see example (50)). 
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(50) VP preceding the matrix clause 
P<en>le’alay=bi senehiyi kari Kiristuka 'alang Besuring wu Talug 
<AV.PRF>first=really Av.believe word Christ LNK village Besuring prt Talug 
Payan. 
Payab 
‘As for the first one to believe the gospel of Christ in the village of Besuring, it 
was Talug Payan? 


The VP preceding u in (50), p<en>le’alay=bi senehiyi kari Kiristu ka alang Besuring can- 
not be interpreted as an adverbial, meaning 'as/because/if one was the first to believe 
the gospel of Christ in the Besuring village’, but only as a referential phrase, meaning 
‘the first one to believe the gospel of Christ in the Besuring village’. 

Instances of saw + noun ‘such things as noun, such things related to noun’ are very 
few for matrix subjects (1 out of 262). We can see more examples in preposed position (1 
out of 13 for S of the S wu VX ka hiya pattern, 6 out 53 for S of the S ’u, VX pattern). This 
distribution seems to be the opposite of that with VPs. 


4 Anaphoricity 


In this section, I will investigate anaphoricity in preposed subjects. 


4.1 Preliminaries 


Anaphoricity is an index of the degree to which a referent can be said to have a discourse 
antecedent. Following Gregory & Michaelis (2001: 1687), I apply the label “anaphoricity” 
to an attribute with three possible values: 


e 0: Tokens containing pre-clausal NPs whose referents have not been mentioned 
in the preceding discourse. 


* 1: Tokens containing pre-clausal NPs whose referents are members of a set that 
was previously evoked. 


e 2: Tokens containing pre-clausal NPs that denote entities that have been men- 
tioned previously in the discourse. 


Examples (51)-(53) illustrate the three possible anaphoricity scores, with referring ex- 
pressions and their antecedents co-indexed in (52) and (53). 

(51) is an example of an anaphoricity score of 0. The preposed NP in this example is 
saw ka qaya samat “those things concerning wild animals. In the previous text, it is not 
mentioned at all, so its anaphoricity score is 0. 
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(51) Anaphoricity score of 0 (not mentioned before) 
Saw ka qaya samat "u des-un-deha be’nux, —seberig-an 
like LNK thing wild:animal PRT bring-Gv1=3PL.GEN plain:land sell-cv2 
kelemukan. 
Taiwanese 


“As for those things concerning wild animals (animal skins, bones, horns, and the 
like), they brought them to the plain and sold them to Taiwanese’ 


(52) has an anaphoricity score of 1. The preposed NP in example (52b) is duma 'some, 
others. It is a member of dehiyaan opt. opt. which was mentioned in (52a). 


(52) Anaphoricity score of 1 (members of a set that was previously evoked) 
a. Bitaq saw m-ahu quyu peqeraqil dehiya'an;. 
until like Av-wash snake Av.torture 3PL.OBL 


H 


“Ihey tortured them up to hitting snake (=expression of harshness) 


b. Duma; 'u pesa-un=deha kulu m-banah. 
some PRT put-GV1-3PL.GEN box av-red 
‘Some were put into prison (=red box)? 

(53) has an anaphoricity score of 2. The preposed NP in example (53b) is dexegal Taiwan 

‘land of Taiwan’ It is mentioned in the previous clause (53a). 


(53) Anaphoricity score of 2 (Mentioned before) 


a. Saw niyi u, berah 'ini "iyah dexegal Taywan; hini ka Nihung 
like this PRT before NEG AV.NFIN.come land Taiwan here spy Japan 
han. 
temporarily 


“Things like these, it was before Japan came here to Taiwan? 


b. Kiyani pa'ah hengkawas 1895 siida, dexegal Taywan; 'u  diy-un 
so  andfrom year 1895 then land Taiwan PRT have-cvi 
k<em>elawa Nihung. 

«AV»govern Japan 


"Ihen, from 1895, the land of Taiwan was owned and governed by Japan: 


4.2 Results 


For preposed subjects, 35 out of 66 had anaphoricity scores of 2, 1 out of 7 had a score of 
1, and 24 of 66 had scores of 0. The results are summarized in Table 6. 
In nearly half of the instances the score was 2. 
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Table 6: Anaphoricity of preposed subjects 


Preposed Subjects % 
0 (=no prior mention) 24 36% 
1 (=member of an activated set) 7 11% 
2 (=prior mention) 35 53% 
Total 66 100% 


4.3 Comparison 

In this section I will compare the anaphoricity scores of preposed NPs with and without 
overt matrix subjects and also with matrix subjects. 

4.3.1 When the matrix subject is overt and when it is not 


A comparison of the anaphoricity of preposed subjects with and without overt matrix 
subjects is shown in Table 7. 


Table 7: Anaphoricity of preposed subjects with and without overt matrix sub- 
jects 


Preposed subject 
S'u, VX ka hiya. S'u, VX. 


0 (=no prior mention) 7 54% 17 32% 
1 («member of an activated set) 0 0% 7 13% 
2 (=prior mention) 6 46% 29 55% 
Total 13 100% 53 100% 


For S of the S ’u, XV ka hiya pattern, the anaphoricity score is either 0 or 2; there is no 
instance of an anaphoricity score of 1. Seven out of 13 are examples where the score is 
0, the other 6 are examples where the score is 2. For S of the S 'u, XV ka hiya pattern, 
about a half of the instances (29 of 53) had scores of 2. We can say that the proportion 
of scores of 2 is similar when the matrix subject is present or absent. For scores of 0 and 
1, two cases show a difference: for S of the S 'u, XV pattern, the anaphoricity score is 0 
(17 out of 53) or 1 (7 out of 53), whereas there are no instances of anaphoricity scores of 
1 for S of the S ’u, XV ka hiya pattern. 


4.3.2 Preposed subjects and matrix subjects 


A comparison of the anaphoricity of preposed subjects and matrix subjects is given in 
Table 8, and illustrated in Figure 3. 
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Table 8: Anaphoricity of preposed subjects with and without overt matrix sub- 
jects, and matrix subjects. 


Preposed subjects Matrix subjects 
S'u, VX ka hiya. S'u, VX. VX ka S. 
0 (=no prior mention) 7 54% 17 32% 98 37% 
1 (=member of an activated set) 0 0% 7 13% 15 6% 
2 (=prior mention) 6 46% 29 55% 149 57% 
Total 13 100% 53 100% 262 100% 
60% 
50% 
409 
30% 
209 
10% 
0% 
no prior mention member of an activated prior mention 
set 
S'u, VX ka hiya. S'u, VX. VX ka 


Figure 3: Anaphoricity of preposed subjects with and without overt matrix 
subjects, and matrix subjects. 


For matrix subjects, the proportion of scores of 0, 1, or 2 is somewhat similar to the S of 
the S wu XV pattern. For both, more than half of the referents (57% for matrix subjects and 
55% for S of the S ’u, XV pattern) had anaphoricity scores of 2 (mentioned in the previous 
discourse), and about one third (37% for matrix subjects and 33% for preposed subjects 
without overt matrix subjects) had scores of 0 (not mentioned at all). The percentage of 
NPs with scores of 1 is somewhat lower for matrix subjects (6%) than for S of the S u, 
XV pattern (13%). 

For S of the S ’u, XV ka hiya pattern, the rate of each score differs from the other two 
types. There is no instance of score 1, and the percentage for score 0 is somewhat higher 
than for the other two. It may be a coincidence which originates from scarcity of the 
data. 
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4.4 Correlation between NP types and anaphoricity 


Let us see the correlation between NP types and anaphoricity. It is summarized in Table 9. 


Table 9: Type of NP and anaphoricity 


Anaphoricity score 


0 1 2 
PP 0 0 2 
D 0 0 6 
PD 1 0 11 
CD 1 0 3 
BPN 12 0 8 
CG 4 0 2 
CR 2 2 0 
CN 1 3 0 
Saw+ noun 2 2 3 
VP 1 0 0 
Total 24 7 35 


It is interesting to see the anaphoricity difference between PD and BPN. For most of the 
PD (11 of 12), the anaphoricity score is 2, while more than half of BPN (12 of 20) had a score 
of 0. When something already referred to is mentioned again, it appears accompanied by 
demonstratives. This seems to support the idea that PDs are activated or familiar while 
BPN are at least uniquely identifiable. For all PP and D also, the anaphoricity score is 
2. This seems to support the idea that PP and D correspond to in focus and activated, 
respectively. 

For those that would correspond to lower givenness (CR, CN, saw + noun or VP), we 
can see that score of 2 is not observed very much. 


5 Topic persistence 


Lastly, I will examine the topic persistence of preposed subjects. 


5.1 Preliminaries 


Iapplied the label of persistence to an attribute with four possible values in the following 
way: 


e 0: The pre-clausal NP denotatum is not referred to at all within five subsequent 
clauses. 
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I added a fourth value to the classification in Gregory & Michaelis (2001: 1689), because 
in Seediq one can omit NPs that are recoverable from the context. I wanted to distin- 
guish covert and overt pronouns, so I limited the score of 2 to those cases with overt 
pronouns and applied a score of 3 to those cases with covert pronouns. The examples 
in (54-57) illustrate the four possible persistence scores, with referring expressions and 


* 1: The pre-clausal NP denotatum is referred to in subsequent clauses by means of 
a lexically headed NP rather than a pronoun. 


* 2: The pre-clausal NP denotatum is expressed pronominally within the five follow- 
ing clauses. 


* 3: The pre-clausal NP denotatum is referred to in subsequent clauses by means of 
a zero pronoun. 


their antecedents co-indexed. 


The preposed NP of (54a), Tiwang niyi ‘this Ciwang; is not referred to at all in at 
least the following five clauses (though the example below only shows the following 
two clauses), so the score is 0. From sentence (54b), the topic of the text is changed to 


Bakan Hagay and Karaw Wacih. 


(54) 


The preposed NP of (55a), Lebak Yudaw ‘Lebak Yudaw, is referred to by a lexically headed 
NP Lebak niyi ‘this Lebak’ in sentence (55b), the following clause. The score is therefore 


1. 


(55) 
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Lack of persistence; score of 0 


a. Tiwang niyi 'u, dima sedu’uy kari Kiristu ka hiya da. 


Ciwang PROX PRT already Av.have words Christ sBJ 3sG NS 


‘This Ciwang, she was already a Christian’ 


. Pa'ah d«en»ehuq-an-na Ekedusan ka Bakan Hagay `u 


from <PRF>arive-GV=3SG.GEN Ekedusan sBJ Bakan Hagay PRT 

“Since Bakan Hagay arrived at Ekedusan; 

'ida s<em>ekuxul=bi m-uyas "uyas 'Utux Baraw deha Karaw Watih 
surely <Av>like=really Av-sing song God Heaven with Karaw Wach 
senaw-na. 

husband-3sG.GEN 

'she liked singing songs about God in heaven with her husband Karaw 
Wach 


Repeated NP; score of 1 
a. Si'idaka Lebak Yudawi 'u, me-'ayung sapah kensat ka hiya ni 


then INK Lebak Yudaw PRT Av-assistant house police s8J 3sG and 


“At that time, Lebak Yudaw was an assistant at the police station and; 
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b. m-bahang kari quri t«en-egesa Kiristu ka Lebak niyi; "uri, 
Av-listen story about <cv>PRF.teach Christ spy Lebak PROX also 


‘this Lebak also heard the story about what Christ taught’ 


The preposed NP in (56a), niyi ‘this, is referred to by the pronoun kiya ‘it, so’ in sentence 
(56b), the following sentence. The score is therefore 2. 


(56) Pronominal use; score of 2 
a. Niyi; wu ‘adi 'utux-ta ka kiya;, 
PROX PRT NEG God=1PI.GEN SBJ it 
“This, it is not our God; 
b. "utux 'amirika ka kiya;. 
God America sBJ it 


‘it is the American God" 


The preposed NP of (57a), kari niyi “this story' is the patient of the predicate verb of 
clause (57b), m-iyah t<em>egesa ‘come to teach, but it is not overt. The score is therefore 
3. 


(57) Zero pronoun use; score of 3 
a. Kari niyi; 'u n-eyah-an-na m-angal pa'ah 'alang Besuring. 
story PROX PRT PRF-come-GV-35G.GEN Av-take from village Besuring 
‘As for this story [=the gospel], it was taken from Besuring village’ 


b. Pekelug m-iyah t<em>gesa__; hiya ka Tiwang wal. 
just Av-come <Av>teach there spy Ciwang Iwal 


“Ciwang Iwal came to teach it just then’ 


5.2 Results 


For preposed subjects, the topic persistence score is 0 for 35%, 1 for 23%, 2 for 33%, and 
3 for 9%. The incidence of score 3 is rather low. It is summarized in Table 10. 


Table 10: Persistence of preposed subjects 


Preposed subject % 
0 (=no persistence) 23 35% 
1 (=repeated NP) 15 23% 
2 (=pronominal use) 22 33% 
3 (=zero pronoun) 6 9% 
Total 66 100% 
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5.3 Comparison 

In this section I will compare the topic persistence scores of preposed NPs with and 
without overt matrix subjects, and also with matrix subjects. 

5.3.1 When the matrix clause subject is and is not overt 


A comparison of topic persistence of preposed subjects depending on the presence or 
absence of overt matrix subjects is given in Table 11. 


Table 11: Persistence of preposed subjects with and without overt matrix sub- 
jects 


Preposed subjects 


S'u VX ka hiya. S'u,VX. 


0 (=no persistence) 2 1% 21 40% 
1 (=repeated NP) 3 23% 12 23% 
2 (=pronominal use) 8 62% 14 26% 
3 (=zero pronoun) 0 0% 6 11% 
Total 13 100% 53 100% 


Topic persistence differs considerably depending on the presence or absence of an 
overt matrix subject. 

62% (8 out of 13) of the instances of the S of the S ’u, VX ka hiya pattern showed a score 
of 2, but only 26% (14 out of 53) of the S of the S 'u, VX pattern. In contrast, 15% of the 
instances (2 out of 13) of the S of the S ’u, VX ka hiya pattern showed a score of 0, while 
40% of the S of the S ’u, VX (21 out of 53) showed a score of 0. There was no instance of 
an overt matrix subject (S ’u, VX ka hiya) that showed a score of 3; the six instances of 
score 3 (11%) are all S of S ’u, VX. For S of S ’u, VX ka hiya, the proportion of score 2 (62%) 
is the highest. For S of S ’u, VX, the proportion of score 0 (40%) is the highest. 


5.3.2 Preposed subject and matrix subject 


Now let us compare preposed subjects with matrix subjects. This is shown in Table 12 
and Figure 4. 

We can see that the matrix subject shows a tendency quite similar to S of S 'u, VX. 
In many instances of both types (45% and 40%, respectively), the referent is referred to 
only once in a single clause (for a score of 0). In contrast, many instances of S of S ’u, VX 
ka hiya (double reference) show a score of 2 (referred to again by a pronoun within the 
following five clauses). The doubly referenced entity somehow tends to persist longer. 
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Table 12: Persistence of preposed subjects with and without overt matrix sub- 
jects 


Preposed subjects Matrix subjects 
S’u, VX ka hiya. S'u,VX. VX ka S. 
0 (=no persistence) 2 15% 21 40% 117 45% 
1 (=repeated NP) 3 23% 12 23% 51 19% 
2 (=pronominal use) 8 62% 14 26% 65 25% 
3 (=zero pronoun) 0 0% 6 1% 29 1% 
Total 13 100% 53 100% 262 100% 
709 
60% 
509 
4096 
3096 
20% 
10% 
09 
no persistence repeated NP pronominaluse zero pronoun use 
S'u, VX ka hiya. S'u, VX. VXka 


S. 


Figure 4: Persistence of preposed subjects with and without overt regular- 
position subjects 


5.4 Correlation between NP types and topic persistence 


Let us see the correlation between NP types and topic persistence, summarized in Ta- 
ble 13. 
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Table 13: NP types and Topic pessistence 


Topic persistence 


0 1 2 3 
PP 0 0 2 0 
D 2 1 2 1 
PD 2 2 6 2 
CD 1 1 1 1 
BPN 5 8 6 0 
CG 4 0 1 1 
CR 2 1 2 0 
CN 2 2 0 0 
Saw+ noun 4 0 2 1 
VP 1 0 0 0 
Total 23 15 22 6 


As for topic persistence, the tendency is not as clear as in the case of anaphoricity. We 
can say that CR, CG, CN and saw + noun, which are supposed to be at the lower position 
in the givenness hierarchy of Gundel et al. (1993), the topic persistence score tends to be 
low (score of 0). 


6 Summary 


I examined the semantics/function of NP type, anaphoricity, and topic persistence in 
preposed NPs. 

Semantically, more than two thirds of the preposed NPs are coreferential with the 
matrix subject. One sixth of them are preposed Time. 

Preposed subjects, As, and possessors can be regarded to indicate aboutness topics, 
and the matrix clause denotes the comment about them. Preposed time, alternative, and 
those preposed NPs that are not classified as any of the above seem to set a frame to the 
information provided by the matrix clause. 

As for NP types, this paper pointed out several characteristics of preposed NPs. We can 
say that preposed subjects tend to be proper names. About half of the preposed subjects 
are bare proper names (BPN) or proper names modified by a demonstrative (PD). Only 
28% of the matrix subjects are BPN or PD, so BPN or PD is a characteristic of preposed 
subjects. 

Another characteristic of the NP types of preposed subjects is that personal pronouns 
rarely appear preposed. Only 3% of preposed subjects are personal pronouns, but 20% of 
matrix subjects are personal pronouns. 
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We can point out one characteristic of S of the S *u, VX ka hiya pattern: common 
nouns rarely appear. When the matrix subject is overt, there are very few instances (7%) 
of common nouns (bare CN), modified by a demonstrative (CD), by another noun or a VP 
(CR), or by a genitive pronoun (CG). For S of the Su VX pattern, one third of preposed 
subjects are CN, CD, CR, or CG. For matrix subjects, the rate of CN, CD, CR, and CG is 
higher; it is about 40%. 

As for anaphoricity, there is no drastic difference between S of the S wu VX ka hiya 
pattern, S of the S ’u, VX pattern, or matrix subjects. Matrix subjects showed slightly 
more (57%) high scores (2) than preposed subjects (53%). Half of the instances of S of 
the S vu VX ka hiya pattern (50%) showed a score of 0, which is the highest proportion 
among the three. 

This paper also examined coreference between types of NPs and anaphoricity. Dif- 
ference in anaphoricity between PD and BPN seems worth noting. Most of the PD are 
already mentioned in the previous discourse, while BPN are not necessarily so. Also, PP 
and D are all mentioned in the previous discourse. 

As for topic persistence, preposed S of the S 'u, VX pattern and matrix subjects showed 
similar tendencies (Table 12 in 85.3.2). Totally, 40% or 45% showed a score of 0, 23% or 
19% showed a score of 1, 25% showed a score of 2, and 12% or 11% showed a score of 3. S 
of the S ’u, VX ka hiya pattern, on the other hand, showed a different tendency: only 14% 
showed a score of 0, 21% showed a score of 1, and 64% showed a score of 2. There are no 
instances of preposed subjects with overt matrix subjects that showed a score of 3. 

To summarize, we can say that S of the S 'u, VX pattern and matrix subjects showed 
similar tendencies except for NP types. What type of NP it is determines whether it 
appears in preposed position or in regular subject position. S of the S 'u, VX ka hiya 
pattern, on the other hand, is used to give further information about a proper name. It 
tends to persist longer in the following clauses. As for anaphoricity, there is no drastic 
difference among the three. 
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Abbreviations 

ANGER anger CNJ conjunction 

AV actor voice Cv conveyance voice 

BPN bare person name CR common noun modified 

CG common noun modified by another noun or VP 
with Genitive pronoun D demonstrative 

CN common noun DIST distant 
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EX exclusive by demonstrative 
FUT future PL plural 

GEN genitive PO Possessor 
GENTLE gentle PP personal pronoun 
GV goal voice PRF perfective 

GV1 goal voice 1 PROX proximant 

GV2 goal voice 2 PRT particle 

HEARSAY hearsay REASON reason 

IN inclusive SBJ subject 

LNK linker SG singular 

NEG negative UNCERTN uncertain 

NFIN non-finite 1 first 

NOM nominative 2 second 

RDPL reduplication 3 third 

PD person name modified 
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Chapter 11 


Some preliminary observations on 
prosody and information structure in 
Austronesian languages of Indonesia 
and East Timor 


Nikolaus P. Himmelmann 


Universitat zu Köln 


This chapter provides a brief overview of what is known about prosody and information 
structure in the Austronesian languages of Indonesia and East Timor. It emphasizes the fact 
that the prosodic systems found in these languages appear to differ substantially from the 
better known systems found in languages such as English and German and finds that to date 
there is little evidence that prosody plays a major role in conveying information-structural 
distinctions. Of major import in this regard appears to be the fact that many Austronesian 
languages in the area appear to lack lexical stress as well as lexical tone. Consequently, 
intonational phrases lack (postlexical) pitch accents, the tonal inventory being restricted to 
a smallish number of edge tone combinations on the intonational phrase level plus a single 
boundary tone on the level of intermediate phrases. The chapter concludes with a brief 
discussion of a major exception to these generalisations, i.e. the group of (Austronesian) 
West New Guinea languages that show a bewildering variety of tonal and stress-related 
distinctions. 


1 Introduction 


Apart from some varieties of Malay, the prosodic systems found in Indonesian and East 
Timorese languages have not been investigated in detail to date. Still, from what is known 
from Malayic varieties! and the few studies on other languages that have been published 
(in particular, Stoel 2006 on Javanese and Himmelmann 2010 on Waima’a) it seems likely 
that prosodic PROMINENCE does not have a major role to play in marking information- 
structural categories. If at all, prosodic PHRAsING may be of relevance in this regard 
inasmuch as it is not determined by syntactic or processing constraints. 


1See Riesberg et al. (2018 [this volume]) for a brief summary of the relevant literature on Malayic varieties. 
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Current ideas on the prosodic marking of information-structural categories, in particu- 
lar focus and activation status (i.e. the distinction between given, accessible and new dis- 
course referents), are based on, and heavily biased towards, what is found in some west- 
ern European languages, in particular the West Germanic languages English, German 
and Dutch. From a cross-linguistic point of view, the prosodic marking of information- 
structural categories in these languages is quite unusual and does not provide a good 
starting point for investigating the relationship between prosody and information struc- 
ture in Austronesian languages of Indonesia. Rather, as it is argued here, it will be more 
productive to start with much simpler assumptions and only take on board more complex 
prosodic features, if the data require them. 

Paradoxically, it will be useful to look at the basic ingredients of West Germanic sys- 
tems in order to make clear what is meant by “simpler assumptions”. Consequently, 
82 briefly lists the essential features of a West Germanic system. 83 presents a gen- 
eral proposal for a stepwise build-up of prosodic systems, not necessarily confined to 
the languages under investigation. It starts from the most minimal assumptions about 
prosodic phrasing and stops at the level of complexity that appears to be widespread in 
the languages of Indonesia. $4 and $5 introduce complications to the relatively simple 
prosodic system sketched in 83. $4 is concerned with the further subdivision of intona- 
tional phrases (IPs) into smaller (lower-level) prosodic phrases, while 85 briefly looks 
at languages in eastern Indonesia where highly unusual word-prosodic systems are at- 
tested. $6 concludes. 

The exposition is couched in the terminology and formalisms used in the 
autosegmental-metrical framework for prosodic analysis (Ladd 2008), and more specif- 
ically the Tone and Break Indices (ToBI) framework (Beckman et al. 2005). This frame- 
work is chosen because it is the currently most widely used and understood approach 
to prosodic analysis, and there are a number of cross-linguistic studies which make use 
of it (see for example the two volumes edited by Jun 2005; 2014). But the current argu- 
ment does not depend on the autosegmental framework and can, in principle, also be 
expressed in other frameworks for prosodic analysis. 

The limitation to Austronesian languages of Indonesia and East Timor is arbitrary in 
the sense that there are Austronesian languages outside this area that may show simi- 
lar characteristics, in particular the ones in Brunei and Malaysia. However, the author 
is not sufficiently familiar with these other languages to be able to make useful obser- 
vations with regard to western Austronesian languages more generally, not to mention 
Oceanic languages. This, in fact, also holds for the Indonesian part of Borneo, known 
as Kalimantan, where the generalizations put forward here possibly do not apply. The 
prosodic systems found in the languages of the Philippines very likely differ in important 
regards.” 


"Thus, for example, Blust (2013: 175) notes: “The most distinctive typological feature in the sound systems of 
Philippine languages is the widespread occurrence of phonemic stress.” The Sangiric languages in northern 
Sulawesi may show related contrasts. See Himmelmann & Kaufman (2018) for a more detailed assessment 
of this claim. 
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2 The West Germanic "prototype" 


Abstracting away from many details, the phonological structure of an intonation phrase 
(IP) in West Germanic can be represented as in Figure 1. Using the conventions of the 
ToBI framework, the T here represents tonal targets which can be either H(igh) or L(ow). 
These targets can be anchored either to the edge of an IP (hence edge tones) or to a 
metrically strong syllable (represented by a bolded c in Figure 1).? The latter are widely 
referred to as (postlexical) pitch accents. The difference between the two edge tone types, 
i.e. boundary tones and phrase accents, is further explained in 83. 


%T T* TT TT% 
4 + 1 4 


[cocococooooooc] 


Metrically-anchored tones 

T* = pitch accent (monotonal) 
T*+T = pitch accent (bitonal) 
Edge tones 

T- = phrase accent 

T% = IP boundary tone (final) 
%T = IP boundary tone (initial) 


Figure 1: Phonological structure of a West Germanic Intonation Phrase 


The occurrence of (postlexical) pitch accents presupposes lexical stress, i.e. the phono- 
logically organized highlighting of a syllable relative to adjacent ones by way of modu- 
lating phonetic parameters such as pitch, intensity and duration. It is doubtful that all 
languages have lexical stress in this sense,’ and as further detailed in 83.2, this appears 
to be the case for most of the Austronesian languages of Indonesia and East Timor. Note 
also that even if all languages had lexical stress, it could be the case that the intonational 
system is organized independently of it. Thus, for example, Lindstróm & Remijsen (2005) 
claim for the Papuan language Kuot, spoken on New Ireland, that it is “a language where 
intonation ignores stress" (Lindstróm & Remijsen 2005: 839). In fact, it may be the case 
that the occurrence of postlexical pitch accents of the West Germanic type is rather rare 
cross-linguistically. 

Most research on intonation, especially in the last three decades, has focussed on 
(postlexical) pitch accents and their function in marking information-structural cate- 


3 Actually, the division is not as straightforward as it is made out to be here. The placement of edge tones, in 
particular phrase accents, may also make reference to metrically strong syllables, as discussed in the Grice 
et al. (2000). 

^The term stress is used throughout this chapter in exactly this sense. Note that stress as widely used in 
the literature often includes other notions such as regular rhythmic alternations between strong and weak 
syllables (foot structure). Claiming that Austronesian languages in the area under discussion usually do 
not make use of lexical stress hence does not necessarily imply that they do not make use of foot structure 
or other word-level prosodic distinctions. 
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gories, in particular focus and activation status (cf. Ladd 2008; Wagner & Watson 2010; 
Baumann & Kügler 2015; Zimmermann 2016 for recent reviews). While this makes sense 
with regard to West Germanic and possibly other European languages, it may be coun- 
terproductive simply to transfer this model to other languages, as further argued in the 
following section. 

The major emphasis on postlexical pitch accents goes hand in hand with prioritiz- 
ing information-structural categories among the three main functions that intonational 
marking may serve. The other two main functions are marking sentence mood (declara- 
tive vs. interrogative, etc.) and delimiting phrases on various levels (phrasing or chunk- 
ing function). While the sentence mood marking function also has received consider- 
able attention in the investigation of European languages, the phrasing function has not 
played a very prominent role. ? 


3 Start simple! 


When looking at prosody in Austronesian languages of Indonesia (and perhaps also in 
many other parts of the world), it is useful to start with the simplest possible assumptions 
regarding a prosodic system rather than with the complex model provided by the West 
Germanic languages. 


3.1 Lexical level 


With regard to the lexical level, the simplest assumption would be that there is neither 
lexical stress nor lexical tone (including so-called lexical pitch accents). In the literature 
on Austronesian languages, as conveniently summarized in van Zanten et al. (2010), it 
has been widely assumed that these languages, including the ones spoken in Indonesia 
and East Timor, have lexical stress systems of various kinds, with a strong preference for 
stress to occur on the penultimate syllable. However, as van Zanten et al. remark in $4 
of their survey (van Zanten et al. 2010: 99-102; see also van Heuven & van Zanten 2007a: 
194), there are good reasons to doubt that the prominence phenomena discussed in the lit- 
erature actually belong to the lexical level rather than to the phrasal level. As we will see 
further below, what has often been described as regular penultimate stress is in fact the 
regular occurrence of a rising-falling edge tone combination at the end of intonational 
phrases. The classic example is Standard Indonesian as spoken in Java, to which a wide 
variety of stress systems have been attributed, but where there is solid evidence that it 
actually lacks lexical stress (see Goedemans & van Zanten 2007 for a summary of the 
relevant research). The work by van Heuven and colleagues shows that this holds true 
for production (acoustics) as well as perception. With regard to the latter, van Zanten 


5Féry (2013) argues that prosodic phrasing is actually more relevant for focus marking than prosodic promi- 
nence, hence questioning the emphasis on prosodic prominences in intonation research. However, the 
marking of information structure (in particular focus) is still considered the primary function of prosody 
in this line of argument. In line with much of the literature, it is assumed here that the phrasing function 
of prosody is not necessarily related to information-structural distinction. This does not preclude the pos- 
sibility that phrasing sometimes may be indicative of focus domains. This issue is taken up again at the 
end of $3.2. 
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& van Heuven (1998) report a gating experiment which shows that Indonesian listen- 
ers were unable to make use of prosodic information in predicting word endings, unlike 
Dutch listeners who performed much better on the same (Indonesian!) stimuli. 

Goedemans & van Zanten (2014) go a step further. Reflecting on the experiences 
accrued in over two decades of compiling and maintaining a database on stress systems 
attested in the world's languages (StressTyp) and in particular the fact that in more 
recent years a number of languages initially classified as having lexical stress had to be 
reclassified as having no stress, they propose the following list of criteria (or indicators) 
for descriptions of prominence phenomena where the proposed analysis as lexical stress 
is doubtful (Goedemans & van Zanten 2014: 88): 


1. Stress is reported to vary in different utterances of the same word 

2. Stress is reported to be a phenomenon related to phrases 

3. Stress is reported to be very weak and unstable 

4. Fundamental differences in stress use of various speakers reported 

5. The reported stress rule makes no sense in any current metrical theory 


The following three indicators are considered not to raise suspicion on their own but to 
strengthen doubts in case at least one of the above indicators holds true (Goedemans & 
van Zanten 2014: 88): 


6. Position in the intonation contour influences stress location in an unexpected way 
7. Numerous exceptions to the rule are reported 
8. Only schwa in penultimate position rejects stress 


A quick glance at the usually terse sections on stress in many a description of an Aus- 
tronesian language makes it clear that more often than not several of these indicators 
apply and that therefore it is not advisable to assume the existence of lexical stress in a 
particular Austronesian language without further validation. 

As for the Austronesian languages of Indonesia, the existence of lexical stress has been 
properly demonstrated only for very few of them. All of these languages are spoken in 
the easternmost part of the archipelago, in the Indonesian part of New Guinea and sur- 
rounding islands (henceforth simply called Papua in this chapter), as further discussed 
in $5 below. 

Note also that it may be the case that there is sufficient evidence for stress-like dis- 
tinctions in some languages of Sulawesi, in particular Central Sulawesi languages, as 
briefly discussed in Himmelmann & Kaufman (2018). Utsumi (2011) claims that in Bantik, 
a Sangiric language of northern Sulawesi, lexical pitch accents regularly occur on either 
the penultimate or the ultimate syllable (hence having a distinctive function). 
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3.2 Postlexical level 


With regard to the postlexical level, the simplest assumption - apart from no use of 
prosody° - would be something like the structure depicted in Figure 2. It shows a 
string of syllables which is separated from adjacent strings of the same type by melodic 
and rhythmic cues. Typical rhythmic cues are lengthening the final syllable of the 
string and pausing. The basic melodic cue pertains to the fact that syllable strings in 
natural languages are produced with a coherent melody, one string being delimited 
from the preceding and following ones by the on- and offsets of a coherent pitch 
contour. Typically, there is a noticeable jump in pitch (up or down) between the offset 
and the next onset. Additional optional cues include non-modal voice quality such as 
devoicing at the end of the string or the occurrence of (non-phonemic) glottal stops at 
the beginning. See Himmelmann et al. (2018) for further discussion and experimentation. 


Coherent melody 
[cocoococococooc] (final lengthening, pause etc.) 


Figure 2: A minimal prosodic structure 


The minimal structure in Figure 2 can serve a basic phrasing function inasmuch as the 
units thus delimited are useful processing units (in terms of planning and/or comprehen- 
sion). As such, their size would not be primarily determined by the exigencies of airflow 
management (breathing), but rather by other factors such as semantic and pragmatic con- 
siderations of information packaging. Thus, for example, Chafe (1994: 108-119) proposes 
that intonational phrases (intonation units in his terminology) are designed to introduce 
one new idea (one piece of new information) at a time into the ongoing discourse." Such 
a function can easily be served by the units in Figure 2. To serve this function, there is 
no need to highlight the new information in some way, or to distinguish different types 
of boundaries. The only requirement is that such a unit never contain more than one 
piece of new information.? An alternative, though partially overlapping, hypothesis for 
the functional basis of the kind of unit depicted in Figure 2 is the idea that it corresponds 
to speech acts, i.e. each speech act is packaged as one prosodic unit.? 


5*No prosody” would mean only purely physiologically conditioned variation in pitch and chunking of 
speech production. Speech chunks would then be completely determined by breathing requirements, with 
no regard for content or structure, each chunk probably starting on a relatively high pitch and gradually 
declining till the end of the unit. Lieberman's (1967) model of intonation is considerably more refined, but 
is based on a model of speech physiology which would roughly produce this kind of output, if the speaker 
were not allowed to control and thereby modulate the basic physiological necessities. 

7A very similar proposal is Pawley & Syder's (2000) one-clause-at-a-time hypothesis. 

5Obviously, the validity of Chafe's one-new-idea constraint depends on being able to provide an indepen- 
dent and operationalizable definition of how to identify one piece of new information. As this hypothesis 
only serves as an illustration of what kind of function the minimal structure in Figure 2 has, there is no 
need here to get into this quite complicated issue. 

"This idea is suggested by the widely recognized sentence mood marking functions of intonation mentioned 
above. See Cresti (1996) for further elaboration. 
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A first maximally simple enrichment of the minimal structure in Figure 2 would be 
the addition of one type of tonal target, i.e. boundary tones, as shown in Figure 3. 


%T T% 
i * 


[coocoocoocoocoo] 


%T, T% = boundary tone(s) (initial, final) 


Figure 3: Minimal tonal prosodic structure (boundary tones only) 


In terms of function, units with the structure in Figure 3 would be capable of mark- 
ing sentence mood-like distinctions in addition to the very basic information packaging 
function served by units of the type in Figure 2. 

Further enrichment of the basic structure in Figure 3 would include the addition of a 
so-called phrase accent, i.e. a pitch target anchored to the edge of the phrase, but not 
necessarily to the very final segment. 


T-T% 
++ 


[coocoocoocoocoo] 


Edge tones 
T% = boundary tone 
T- = phrase accent 


Figure 4: Enriched minimal tonal prosodic structure (edge tones only) 


The term phrase accent has been used for at least three, prima facie different kinds of 
phenomena: 


1. fora postnuclear prominence occurring at the right edge of an intonational phrase 
which is part of a complex phrase-final edge tone and typically anchored to a 
metrically strong syllable (ToBI labels T-T%; cp. Grice et al. 2000; Ladd 2008: 142- 
147); 


2. for the boundary tone of a so-called intermediate phrase, ie. a phrase that is 
smaller than an IP but larger than a prosodic word (cf. Beckman & Pierrehumbert 
1986 and much subsequent work); 


3. for a major pitch excursion occurring at the right or - much more rarely - the left 
edge of an intonational phrase in some languages which otherwise do not appear 
to make phonological use of pitch changes, i.e. lacking lexical tone distinctions 
as well as postlexical pitch accents (e.g. French, Indonesian, Waima'a). This pitch 
excursion may, or may not, be accompanied by extra duration. 
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It is a matter for further research to determine whether these three phenomena have 
enough in common to warrant subsumption under a common notion. Alternatively, we 
are dealing with three phenomena which share the positional feature of edge placement 
but otherwise have different properties. Part of resolving this issue will be the question of 
which functions phrase accents serve. Type 2 phrase accents appear to have a boundary- 
marking function but is this also true for the other two types, both of which are followed 
by boundary tones proper? And, inasmuch as phrase accents have a boundary-marking 
function, what is the function of the units thus delimited? 

Many Austronesian languages of Indonesia seem to have prosodic units of the basic 
type shown in Figure 4. The available descriptions report differences with regard to the 
exact placement of the phrase accents. Common options include: 


e ‘free’ variation within a 2-syllable window (e.g. Javanese); 


e mostly penultimate syllable, but sometimes also on ultima (e.g. Manado and 
Papuan Malay, Waima’a). 


It is unclear whether these reported differences are actually factual differences. Alter- 
natively, they arise from different analytical procedures and theoretical frameworks. 
With regard to Ambon Malay, Maskikit-Essed & Gussenhoven (2016) provide produc- 
tion evidence for an analysis which considers the configuration depicted in Figure 4 as 
free floating boundary tones not anchored to segmental landmarks in the same way as 
phrase accents in European languages (including, in particular, French). Although the 
pre-boundary pitch movement distinguishing this configuration from simple boundary 
tones as in Figure 3 is often perceived by Western researchers as being anchored to ei- 
ther the penultimate or the ultimate syllable, various measurements indicate that — at 
least in the case of Ambon Malay - the position of the peak of this pitch movement is 
highly variable and correlates much less strongly with potential segmental landmarks 
than typical European postlexical accents. 

It is not clear whether this analysis for Ambon Malay also applies to other languages, 
which to date have not been investigated to the same degree of detail as Ambon Malay. 
Only such more detailed analyses will show whether the different descriptions reported 
above correspond to factual differences. In the remainder of this chapter, we will con- 
tinue to use the term phrase accent to refer to the configuration in Figure 4, with the 
understanding that the details of the analysis, and in particular the specifics of tune-text 
association, are yet to be worked out. 

In the current context, the question of what functions phrase accents may have in 
marking information structure is a major concern. In languages with relatively fixed 
word order, it is unlikely that the phrase accent directly marks information-structural 
categories such as focus, as its position is constrained to a relatively small window (usu- 
ally two syllables) at the IP edge. The text occurring in this window is often just a single 
word (or part thereof) or the final syllable of a content word plus a phrase-final parti- 
cle. Hence the phrase accent occurs on the word/word - particle that happens to be in 
edge position, regardless of its information-structural status. Insofar as speakers do not 
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have a choice with regard to determining which word occurs in edge position, they do 
not have a choice to determine which word (or syllable) is “highlighted” by the phrase 
accent. Compare the two examples from Papuan Malay in (1). 


(1) Papuan Malay (elicited) 


a. baju 
shirt 
‘shirt’ 

b. baju mera 


shirt red 
‘red shirt’ 


š eee Ae — tumi Mim 
165 
140} : / NA Ñ JN 
1204 . ux A ZS N 
S VI Wa 
š 100 dE: N 
š N 
75 | | 1 
H- L% H- L% 
l l 
ba ju ba | ju | me ra 
0 3.804 
Time (s) 


Figure 5: F0 and waveform for ex. (1a) ? 


As Figure 5 shows, the phrase accent remains at the right edge when another word is 
added to the phrase. That is, in a phrase such as baju mera the phrase accent cannot occur 
on baju, but necessarily occurs on mera, because the order of these two constituents 
cannot be changed. Consequently, in languages which make use of a phrase accent and 
have fixed word order in at least some phrase types, it is prosodically impossible to mark 
a difference in (contrastive) focus of the kind seen in English BLUE car vs. blue car. This 
is nicely illustrated by the following example provided in Stoel (2007), which comes from 
a corpus of Manado Malay spontaneous narrative speech. 


1? AI] acoustic analyses presented in this chapter were carried out and plotted with PRAAT (Boersma & 
Weenink 2015). 
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(2) Manado Malay (Stoel 2007: 121) 
dorang mo cari tiga: - tiga oto (=) cuma: (0.4) kita cuma dapa satu oto 
3P ASP look.for three three car only is only get one car 
'(We planned to go to the wedding party by car.) They were looking for three ... 
three cars. But... I only got one car.’ 


° 
š F E: ët teg we 
É 
260 
/N ~ 
ol P UN ARA d 
= "Ni N. š ke A cae dE 
= Le 3 VOX 
= V ` : b 
5 
E 1504 
130 T T T T T X 
H L96 H- L% H. L% 
l l l 
; V 
ti | ga} o to | cu ma: 0.4 ki | ta [cu malda| pat | sa | tu | o to: 
0 2.732 
Time (s) 


Figure 6: F0 and waveform for example (2) H 


Despite the fact that in example (2) tiga ‘three’ is contrasted with satu ‘one’, the phrase 
accent occurs on oto in both instances, because this word takes up the final two syllables 
in both IPs. 

The preceding examples should also make it clear why so many descriptions of Aus- 
tronesian languages of Indonesia contain the assertion that lexical stress mostly/always 
occurs on the penultimate syllable. In elicitation, lexical items tend to be produced with 
the most unmarked declarative intonation pattern, which in many languages includes 
a phrase accent heard by the researcher to be located in the penultimate syllable, as il- 
lustrated by (1a). But when occurring in non-final position in larger structures as in (1b), 
more often than not no trace of this presumed lexical stress can be detected. 

This also brings us back to the prosodic structure shown in Figure 1, which differs 
from the one in Figure 4 in that it includes metrically-anchored tonal targets in addition 
to edge tones. As already noted in 82, metrical anchoring presupposes lexically-based 
prominence differences (i.e. lexical stress) which specify the syllables that may serve as 
anchors for (postlexical) pitch accents. (Postlexical) pitch accents are a major way to 
prosodically mark the information status of individual words and phrases. Hence, the 
fact that prosody only plays a limited role in the expression of information structure in 


"Only IP edge tones are tonally annotated here. The analysis of the rise across kita cuma is discussed in the 
following section. 
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many Austronesian languages of Indonesia is related to the lack of lexical stress (lexically 
encoded prominence differences) in many of these languages. 

However, as shown by many languages around the globe including many African lan- 
guages and Korean,12 (postlexical) pitch accents may not be the only prosodic means to 
mark information-structural categories. Prosodic (re- or de-) phrasing may also serve 
this purpose. A classic and much discussed example comes from the Bantu language 
Chichewa as analysed by Kanerva (1990). While broad focus utterances tend to be pre- 
sented in single prosodic phrases, narrow focus on one of the constituents requires the 
insertion of a prosodic boundary after the focussed word, as can be seen when comparing 


(3a) with (3b): 
(3) Prosodic rephrasing in Chichewa(Kanerva 1990: 98) 

a. (a-na-ményá nyu" bá "dí m<wáála)! 
1.SBJ-RECENT.PAST-hit 9.house with 3.rock 
‘He hit the house with a rock.’ (answering: What did he do?) 

b. (anaméenya) (nyuti™ ba) ("di mwáála) 
‘He hit the house with a rock.’ (answering: What did he do to the house with 
the rock?) 


In fact, Féry (2013: 683) proposes "that the most common prosodic realization of focus 
can be subsumed typologically under the notion of alignment: a focussed constituent is 
preferably aligned prosodically with the right or left edge of a prosodic domain the size 
of either a prosodic phrase or an intonation phrase” 

This view would appear to contradict our assessment above that phrase accents, which 
occur at the right edge of IPs, do not mark information-structural categories for the sim- 
ple reason that most Austronesian languages of Indonesia do not freely allow to move fo- 
cussed words into this position. While speakers thus cannot simply move words around 
so that they occur in IP-final position, speakers have great freedom in determining the 
size of an IP. So, in principle, there is the possibility to bring a focussed word into IP-final 
position by inserting a prosodic boundary behind it (similar to the Chichewa example 
(3)). However, this possibility does not appear to be systematically used in Austronesian 
languages of Indonesia. As we will see in the following section, there is some variability 
with regard to the placement of the phrase accent (at least in Manado Malay), but it is 
not the case that the word occurring in IP-final position always belongs to the focus do- 
main. Furthermore, most instances of narrow focus are not signalled by inserting an IP 
boundary. Thus, looking again at example (2), there is no example known to this author 
where an IP boundary is inserted after a numeral in narrow contrastive focus (here satu), 
separating it from the following non-focussed nominal head of the phrase (here oto). 

However, IP boundaries are not the only kind of prosodic boundary. In the literature, a 
fairly heterogeneous group of phrase types is distinguished in between the phonological 


12See Jun (1998) for a detailed study of Korean dephrasing. 
PMorpheme breaks and glossing from Downing & Pompino-Marschall (2013: 651). Numbers refer to noun 
classes. 
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word and the intonational phrase, these two levels being uncontroversially recognized 
in all frameworks and widely believed to be found in all languages (but see Schiering 
et al. 2010). Thus, for example, Kanerva (1990) speaks of focus phrases in reference to the 
smaller kind of phrases seen in example (3b) and argues that these are one levelbelow the 
intonational phrase in Chichewa. Consequently, to provide a reasonably comprehensive 
survey of the relation between prosodic phrasing and information structure in Austrone- 
sian languages of Indonesia, we have to see whether there is evidence for phrasing units 
smaller than IPs but larger than phonological words. This will be the topic of the next 
section. 


4 Where things get more complicated 1: Intonation 
(postlexical prosody) 


The structure given in Figure 4 above is not the whole story for the Austronesian lan- 
guages of Indonesia disposing of this basic type of prosodic structure. In addition to the 
pitch modulation occurring in a two syllable window at the right edge of an IP, which 
here is interpreted as a phrase accent, there tend to be further pitch rises earlier on in 
an IP which are not accounted for by Figure 4. Example (4) from Totoli illustrates this. 


(4) Totoli (elicited) 
[[ Ali] anu nangaan-ko] 
PN REL AV.RLS:eat=AND 


“Ali was the one who ate it.’ (Answering the question ‘who ate the banana") 


o 
E 
z 
245 
2004 
— A 
D 1504 V Pai be 
5 X ~ 
E —Ó 
& 
93 | | | 
H L- H% 
l l l 
ij a |lila|nu| na ngaan ko 
0 1.797 
Time (s) 


Figure 7: F0 and waveform for example (4) 


That is, IPs can be prosodically chunked into smaller units which do not interrupt the 
melodic and rhythmic coherence of the larger IP. These smaller chunks here are called 
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intermediate phrases (ip) and the boundary tone that delimits their right edge is repre- 
sented by H$ in the tonal tier. They exhibit the following features throughout the area 
of investigation: 


* The major boundary marker for ips is a H(igh) tone on the unit-final syllable, the 
peak usually being located at the very end of it. This syllable is not markedly length- 
ened or otherwise prosodically highlighted in addition to bearing the boundary 
tone. In theory, the boundary marker for ips could also be a L(ow) tone, but in 
all the Austronesian languages of Indonesia and East Timor the author is familiar 
with, it is always H. 


e In case there are two or more consecutive ips in an IP, the unit-final Hs tend to be 
downstepped. However, downstepping does not regularly include IP-final phrase 
accents and boundary tones, i.e. an IP-final high phrase accent or boundary tone 
is often (but not necessarily) higher than any of the preceding H$ targets. Example 
(4) illustrates this for a final H-boundary tone (H%). 


e No pauses or other rhythmic boundary markers may occur at an ip boundary. 


Similarly, there is no interruption of the overall pitch contour (i.e. no offset-onset 
phenomena). However, there is always a tonal reset in that the beginning of the 
following unit always involves a (consecutive) fall to a lower pitch level. This ip- 
initial low(er) target is often reached within the first syllable of the following ip, 
but it may also occur somewhat later (2"4 or even 3“ syllable). Cf. the discussion 


of example (5) below. 


e While the rise towards the final H may begin earlier on in the unit, the penultimate 
syllable is not prosodically highlighted in any special way. 


To date, the details of the more fine-grained prosodic structure of the ip in Aus- 
tronesian languages of Indonesia are not yet well understood. There appears to be 
considerable variability both within and between languages. Stoel (2005; 2007) observes 
that in Manado Malay pitch tends to continuously rise across the ip, therefore postu- 
lating an initial L$-boundary tone. Figure 8 provides an abstract representation for the 
resulting structure. 


The analysis shown in Figure 8 is of course only one of a number of different possible 
analyses for the observed state of affairs. A more standard ToBI analysis, for example, 
would not make use of a special symbol ($) for ip boundaries, but analyse these as phrase 
accents (T-) as well, i.e. conflating the 2"¢ and 3" meanings of “pitch accent” distin- 
guished in reference to Figure 8 above. A major reason for this alternative analysis is 
Selkirk’s (1984) Strict Layer Hypothesis which predicts that each layer on the prosodic 
hierarchy is exhaustively parsed into constituents of the same type on the next lower 
level. Thus IPs should exclusively consist of ips. But in in Figure 8, the IP is parsed into 
two ips plus a third unit of an apparently different status rather than into three ips. The 
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L$ H$ L$ H$ TT 
+ + + 1 ++ 


[[oooco]plooocooco]ipooco]rp 


T$ = ip-boundary tone 
T% = IP boundary tone 
T- = IP phrase accent 


Figure 8: The intermediate phrase (ip) 


major reason for not following the more mainstream analysis here is that it is not clear 
that the boundary tones of ips and the phrase accent, which is a part of the edge tone 
combination marking IPs, really are similar enough to be considered tonal targets of the 
same type. We come back to this issue at the end of this section after providing more 
detail on the form and function of ips. 

Intermediate phrases with the structure in Figure 8 are also found in East Timorese 
Waima'a. However, in Waima'a, ips are often essentially flat, the peak of the H$ rise 
being followed by a short fall back to the base line, as seen in (5). 


(5) Waima'a ([pesawat 41]^) 
ne kara data naha barse ne whaka ige la rihu ne'i wake nin(i) 
3s want alight if ^ seem 3s fly PTLLOCfog PRX below Poss 


‘if it were about to land, then it should fly below the clouds’ 


There are various possibilities for analysing the pitch trajectory in the two ips seen in 
Figure 9, including also an initial L(ow) boundary tone (the difference between the struc- 
ture in Figure 8 and the one seen in Figure 9 would then have to be captured by different 
specifications for phonetic implementation). Alternatively, one could analyse this config- 
uration as involving a final HL$ boundary tone, with the low target usually being reached 
on the first or second syllable of the following unit. This is not the place to argue one 
or the other solution. The important point to keep in mind is that despite considerable 
variability regarding the details of the pitch contour, what all ips have in common is that 
there is an H target in the final syllable. While IPs may also end on a final H target, this 
final target is immediately preceded by another pitch target - the phrase accent - which 
is not found in ips. 

As seen in example (5), an ip may be quite long and span a number of words and 
even complete (subordinate) clauses. It is thus clearly larger than the units analysed 
as phonological words and accentual phrases in the literature. Instead of intermediary 


“This pattern is also found in Manado Malay, e.g. example 30 in Stoel (2007: 130). 

Examples from the corpora listed in the Sources section at the end are indexed for the recording and line 
they are taken from. Elision of syllables is common in natural Waima’a discourse. In (5), for example, the 
initial conditional clause ne kara data naha is shortened to ne katatona. The regularities of syllable elision 
and concomitant sound changes are, however, not yet understood. 
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Figure 9: F0 and waveform for example (5) 


phrase, the units under discussion could also be called prosodic phrases or phonological 
phrases.!Š All of these three terms are used in very different ways in the literature and it 
is not clear that the units thus labelled are actually instances of a common general type. 
Hence, intermediate phrase here specifically applies to the kinds of units defined at the 
beginning of this section. It is a matter for further research to determine whether these 
units have essential features in common with units referred to by the same label in other 
languages (English and Japanese, for example, as analysed in Beckman & Pierrehumbert 
1986). 

As a general rule, the size of ips is determined by syntax. That is, their boundaries 
usually match syntactic constituents such as NPs, VPs or clauses, with the possibility 
that heavy constituents such as NPs which include a relative clause are chunked into two 
ips. A fully worked out analysis of ip chunking is not yet available for any Austronesian 
language of Indonesia. The following patterns have been observed in the author's data 
for Totoli and Waima’a. 

It is quite common that the initial word in an IP is chunked as an ip if it is a question 
word, a conjunction, or an imperative marker. In (6), the initial word is the question 
word isei ‘who’, in (7) it is the conjunction tamba “because”, and in (8) it is the negative 
imperative marker deme’e ‘don’t’. For such relatively short ips it appears to be the rule 
that pitch rises continuously throughout the ip, as seen in the corresponding figures. 


léThe latter term is used by Stoel (2007) who uses a slightly different analytical framework but his phono- 
logical phrase clearly matches what is labelled intermediate phrase here. To wit: “There are two prosodic 
constituents that are particularly relevant for the description of Manado Malay intonation: the Phonolog- 
ical Phrase (PhP) and the Intonation Phrase (IP). The PhP is defined here as a prosodic constituent that 
begins and/or ends with an edge tone. The IP is defined as a prosodic constituent that contains one or more 
PhPs, but no more than one pitch accent [i.e. phrase accent in the terminology used here, NPH]. IPs do not 
have any associated edge tones. A PhP corresponds roughly to an XP at the syntactic level, and an IP to a 
clause. An IP may be followed by a short pause, while a PhP may not. It is characteristic for Manado Malay 
that the accent-bearing unit is a relatively high-level unit, whereas in many European languages, not only 
the IP, but also the PhP, may have more than one accent" (Stoel 2007: 121). 
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(6) 


(7) 
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Totoli (elicited) 


isei  nangaanko 


saginna 


who? AV.RLS:eat:AND banana:3s.POsS 
‘Who ate his/her banana?’ 


Wave 
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Figure 10: F0 and waveform for example (6) 


Waima’a (elicited) 


tamba ai-sa’i 


because yesterday 1s 


aku bira 
sick 


‘because I was sick yesterday’ 
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Figure 11: F0 and waveform for example (7) 
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(8) Waima'a (elicited) 
deme'e sike mala ne'i 
NEG.IMP touch box PRX 


‘Don’t touch this box.’ 
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Figure 12: F0 and waveform for example (8) 


Similarly, initial adverbial phrases such as pas la n'iki “right here' in (9) form their own 
ip. In this example, the pronominal subject and the verb (ne soke ‘he crashes’) also form 
an ip of their own. 


(9) Waima'a ([pearcarlito 101]) 
pas la nikinesoke la watu see 
right Loc here 3s crash Loc stone one 


‘and right then he crashes into a stone.’ 


Initial subject or topic NPs also tend to be phrased as ips. In (10), kii ba'an ke 'the old man' 
exemplifies this preference. As opposed to the preceding example, here the following VP 
is not phrased independently but forms one longish final phrase together with the local 
adjunct la kai-oo kai-oo ta ‘in the tree tops’. To date, the phrasing regularities for VPs 
and constituents following VPs are not yet well understood. 


(10) Waima'a ([pearcarlito_79]) 
kii ba'an ke uhu naga kai-wuola kai-oo kai-oo ta 
person HON-N DEM pick conr fruit Loc tree-top tree-top DIST 


‘the old man just keeps on picking fruits in the tree tops’ 


Finally, there is a strong tendency to phrase clauses separately in case a single IP contains 
more than one clause. This holds for subordinate clauses, as already exemplified with 
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Figure 13: F0 and waveform for example (9) 
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Figure 14: F0 and waveform for example (10) 


example (5). Example (11) shows that the length of subordinate and main clauses does 
not appear to play a major role in this regard, i.e. ip boundaries are inserted even when 
the overall IP is relatively short. 


(11)  Waima'a ([pearcarlito 103]) 
soke watu see ne loi 
crash stone one 3s fall 


‘crashing into a stone, he falls’ 
Two clauses may also be combined in a single IP if they are parallel in structure. In 


example (12), the parallelism is emphasized by the preverbal particle oo ‘too, as well’ in 
both clauses. 
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Figure 15: F0 and waveform for example (11) 


(12)  Waima'a ([pearcarlito 143]) 
wuo-telu ana oo laka ne oo laka 
cLr-three DIM too go 3s too go 


“the three of them walk off, (and) he also walks off 
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Figure 16: F0 and waveform for example (12) 


The preceding examples illustrate strong tendencies where it seems reasonable to ac- 
count for ip chunking in terms of syntactic structure. For these examples, it is not imme- 
diately obvious that information structure has a role to play. Nevertheless, it is also clear 
that a purely syntactic account will not suffice in all instances as none of these tenden- 
cies is actually obligatory (i.e. subordinate clauses do not have to be phrased as separate 
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ips, for example). It may thus very well turn out that some aspects of ip chunking are 
sensitive to information-structural factors. 

A case in point are VPs and constituents following them. For this syntactic config- 
uration, no clear syntactic tendencies have been detected so far (cf. examples (9) and 
(10) above). A particularly conspicuous example in this regard is example (5) where the 
final H$-boundary splits a complex PP/NP into two parts: la rihu ne’i ‘Loc this fog(’s)’ 
and wake nini ‘below poss’. The first part of this PP, which contains the preposed pos- 
sessor NP ‘this fog’, is chunked with the remainder of the clause (barse ne whaka ige 
‘it should fly’ (= modal particle + subject pronoun + verb + particle). The second part 
consists of what is formally the head of the complex NP, the possessum ‘below’ (more 
literally: ‘its underside’). Here, it may be speculated that ip chunking puts special em- 
phasis on ‘below’, as flying below the fog may have prevented the plane crash reported 
in the narrative from which this segment is taken. 

Another example for the possible influence of information structure on ip chunking 
comes from Manado Malay. Stoel (2007) observes the following possibility for prosodi- 
cally highlighting a word which does not occur in IP-final position. In Manado Malay, it 
is possible to add one, and exactly one, ip after the word carrying the phrase accent. This 
post-accentual ip is characterized by a compressed pitch range, usually being almost flat 
on a low tonal level, with the possibility of ending with a smallish final fall. Example 
(13) can be produced in the two different prosodic shapes presented in Figure 17 and 
Figure 18, respectively (both elicited). 


(13) Manado Malay verb focus (Stoel 2007: 126) 
dia da bamara pa Weni 
3s AsPangry at Weni 


‘She is angry at Weni.’ 


Note that in both instances, according to the analysis proposed by Stoel, the sentence is 
chunked into three ips, i.e. [dia] [da bamara] [pa Weni]. In Figure 17, the first two units 
are characterized by the rise from a low initial target to a H tone on the final syllable 
of the ip, as is typical for ips. In the third unit, pa Weni, there is a phrase accent on the 
penultimate syllable We, followed by a fall in the final syllable, ie. the typical pattern 
marking the end of an IP. Information-structurally this is a relatively neutral rendering, 
compatible with broad focus and object focus contexts. 

In Figure 18, on the other hand, the phrase accent occurs on the penultimate syllable 
of the second unit [da baMAra], followed by a fall which continues throughout the third 
unit [pa Weni]. The post-accentual ip is analysed by Stoel as involving only a single L 
boundary tone at the right edge, as opposed to the continuous L to H-rise typical for 
ips preceding the pitch accent. He calls it “encliticized” (Stoel 2007: 121), as it appears to 
contain (usually already) well-known information that is added to a unit which in itself 
is already complete. The rendering in Figure 18 is appropriate for contexts involving a 


"The following three figures are directly quoted from Stoel (2007), hence the difference in layout and anno- 
tation detail. No sound files were available to the present author. 
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Pitch (Hz) 


Time (s) 


Figure 17: Example (13) with unmarked prosodic phrasing (= Figure 5 in Stoel 
2007: 126) 


Pitch (Hz) 


0 1.52 
Time (s) 


Figure 18: Example (13) with verb focus (= Figure 6 in Stoel 2007: 127) 


narrow focus on the predicate. Example (14) illustrates narrow focus on the predicate 
from spontaneous speech. 


(14) Manado Malay verb focusStoel 2007: 126 
da orang cari pa ngana 
ASP person look.for at 2s 


‘Somebody was looking for you.’ 


Finally, there are examples where the lack of expected ip chunking appears to be influ- 
enced by information-structural considerations. This is attested in utterances where the 
utterance-final word carries contrastive focus as in she does not like RED, she likes GREEN. 
The Waima'a example in (15) illustrates. 
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Pitch (Hz) 


da o ray ca ri pa ja na 


0 1.23 
Time (s) 


Figure 19: Example (14) with verb focus (- Figure 8 in Stoel 2007: 128) 


(15) Waima'a (elicited) 
nede kara haru lumu 
38 NEG like shirt green 


‘S/he doesn't like the green shirt.’ 
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Figure 20: F0 and waveform for example (15) 


However, that the lack of ip chunking may be a way to convey contrastive focus on the 
word it appears on is only a conjecture that needs more testing and research. Note that 
even if this conjecture turns out to be true, it would not allow for a consistent marking 
of contrastively focussed items because the phrase accent is confined to the phrase-final 
word. Hence, in the current example it would not be possible to contrast the sHIRT green 
with the sKIRT green because the word order requires the adjective to follow the noun 
(cf. example (2) above). 
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To sum up this section, the intonational structure in Austronesian languages of Indone- 
sia and East Timor appears to be more complicated than suggested by Figure 4, because 
there is an additional level of intonational structure below the IP, i.e. the intermediate 
phrase. The regularities obtaining for ips are not yet well understood. This concerns both 
their tonal structure and the factors determining ip boundaries. The major tonal target 
in an ip is a H tone which consistently appears on the final syllable. What is not clear 
yet, is whether there is also an initial tonal target (in at least some of the languages in 
the region) and how to analyse the different trajectories for reaching the final H$ (con- 
tinuous rise throughout the ip vs. rise over the last few syllables of the ip vs. steep rise 
on the final syllable only). 

A further unresolved issue - briefly mentioned in connection with Figure 8 - is the 
question of how to analyse the (usually) final segment of an IP which follows the last 
H$ and contains the phrase accent and final boundary tone. Should this segment also be 
analysed as an ip (as the strict layer hypothesis Selkirk 1984 would demand)? But then, 
how can one explain the fact that tonal targets in this segment can be much more varied 
than in pre-final ips and obey different alignment regularities than the final boundary 
tone of ips? One option is to assume that ip-level tones are deleted at IP boundaries 
and overwritten by the higher-level IP edge tones, as proposed by Khan & Khan (2014: 
83) for Bengali (similar proposals have also been made for other languages, as pointed 
out by Khan & Khan 2014). However, it is not clear what kind of empirical evidence 
would support such an analysis. Furthermore, inasmuch as IP-level boundary tones are 
of a different type and do not include ip-level tones as a constituent (as in the model of 
Beckman & Pierrehumbert 1986, for example), this can still be seen as a violation of the 
strict layer hypothesis. 

As for boundary-determining factors, it is clear that ip boundaries generally obey ma- 
jor syntactic phrase boundaries such as NP, VP and PP. But exceptions occur, as seen in 
example (5). Furthermore, there appears to be a general tendency to phrase separately 
preverbal constituents of various types, including nominal and prepositional phrases 
and single word-constituents such as question words and conjunctions. The latter clearly 
show that constituent length is not a primary factor in ip chunking. There is also a clear 
tendency to phrase clauses separately if they occur in a single IP. No regularities for the 
VP and following constituents have been discovered so far. Similarly, it is not clear why 
expected ip boundaries are occasionally missing. It may well be the case that these types 
of examples involve information-structural influences on ip-phrasing. 


5 Where things get more complicated 2: Tone and stress 
(lexical prosody) 


In $3.1, it was noted that there is little or no evidence for word-prosodic distinctions 
in many Austronesian languages of Indonesia. In particular, there is little evidence for 


lexical stress, which is of primary concern here. The present section serves to briefly 
point out that, though comparatively rare, the Austronesian languages of Indonesia may 
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show considerable prosodic complexity on the word level, in particular in the eastern 
parts of Indonesia.? Furthermore, and more importantly, it seems that wherever there 
are word-prosodic distinctions they tend to be highly constrained in terms of (a) the 
position within the word or phrase, (b) the types of contrast allowed for, and (c) the 
interaction with other prosodic subsystems. 

Tonal contrasts have been reported primarily for a number of Austronesian West New 
Guinea languages (spoken in Indonesia's two easternmost provinces Papua and Papua 
Barat). Remijsen (2001) and Kamholz (2014: Chapter 5) provide succinct surveys as to 
what is known about tone in Austronesian Papua. The languages analysed so far show 
a broad variety of tonal systems. Magey Matbat, spoken on Misol, one of the Raja Ampat 
islands, is analysed by Remijsen (2007) as a syllable tone language with six different tones. 
A large part of the Magey Matbat vocabulary appears to be monosyllabic, but bi- and 
trisyllabic words also occur. All monosyllabic words bear tone. From the few examples 
provided in Remijsen (2007) it appears that at least one syllable in polysyllabic words is 
toneless, but the position of tone-bearing syllables is not predictable. This contrasts with 
Moor, a language spoken in southern Cenderawasih Bay, which is analysed by Kamholz 
(2014: 101-106) as disposing of four tonal patterns. Tonal marking in Moor is largely 
confined to the final two syllables. More importantly, and rather unusually for a tone 
language, "tones are realized only on phrase-final words" (Kamholz 2014: 102). It seems 
likely that tonal marking here interacts with the phrase-accent+boundary tone typical 
for IPs in Austronesian languages of Indonesia, a topic not addressed by Kamholz. 

A particularly complex - and cross-linguistically unusual - word-prosodic system is 
found in Ma’ya, the largest of the Raja Ampat languages. (Remijsen 2001; Remisen 2002) 
makes a convincing case for an analysis in terms of both lexical stress and lexical tone. 
There are three tonal contrasts which, however, are confined to the final syllable. In ad- 
dition, lexical bases differ in whether they are stressed on the penultimate or ultimate 
syllable. That is, there are minimal pairs which differ only with regard to tone, e.g. sa? 
‘to sweep’ vs. sa? ‘to climb’ vs. toneless sa ‘one’ (Remijsen 2002: 596). And there are 
minimal pairs differing only in stress, e.g. "mana? ‘light (of weight)’ vs. ma’na? ‘grease’ 
(Remijsen 2002: 600). Note that Remijsen (2002: 602-610) provides detailed acoustic ev- 
idence for the proposed stress difference, which includes not only duration measures, 
but also differences in vowel quality and spectral balance. 

Unfortunately, neither Remijsen nor Kamholz discuss postlexical prosody in the lan- 
guages they investigate. Hence it is unclear whether the word-level prosodies inter- 
act with postlexical tonal marking. Consequently, it is also unclear whether word-level 
prosodies have any role to play in conveying information-structural distinctions. 


18 This is not intended to be a comprehensive survey of word prosodies in Austronesian languages of Indone- 
sia, for which see van Zanten et al. (2010) (to be read with the caveats found in van Heuven & van Zanten 
2007b and Goedemans & van Zanten 2014). 

Tt is quite likely that there are more tone languages in this area than listed in Kamholz (2014). Among the 
Raja Empat languages, Ambel also has tone (Arnold 2017). In Yapen, current work by the author points to 
tone in Wooi. 
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6 Conclusion 


In this chapter, it has been argued that prosodic systems in the Austronesian languages 
of Indonesia work somewhat differently from what is known from West Germanic lan- 
guages (on which most current prosodic theory is based), and should be approached ac- 
cordingly. Most importantly perhaps, there is little evidence for lexical stress in many of 
these languages. And even if there are stress-like distinctions, it should not be presumed 
that these interact with the intonational system in a way similar to what has been found 
for Germanic languages (i.e. ‘stressed’ syllables do not necessarily serve as anchors for 
intonational tonal targets, recall Lindstróm & Remijsen's 2005 *a language where into- 
nation ignores stress"). Intonational targets appear to be placed with reference to the 
boundaries of prosodic units. Two types of units need to be distinguished, the higher- 
level Intonational Phrase (IP) and the lower level intermediate phrase (ip). Regularities 
for phrasing on both levels are not yet very well understood. Information-structural fac- 
tors such as focus and activation status may play a role here, but it is unlikely that they 
suffice for a full account of prosodic phrasing on either level. 
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Abbreviations 


Conventions in the examples: each line is one IP; (=) indicates latching; pause length is 
given in ( ); - marks truncated IPs; < > surround false starts. 


Glosses for grammatical categories: 


AND andative N phrase-final nasal in 
ASP aspectual particle Waima'a with attributive 
AV actor voice function 
CLF classifier NEG negation 
CONT continuative (aspect) P plural pronoun 
DEM demonstrative POSS possessive 
DET determiner PN personal name marker 
DIM diminutive PRX proximal (demonstrative) 
DIST distal (demonstrative) PTL particle 
HON honorific REL relative marker 
IMP imperative RLS realis 
LOC locative (preposition) S singular pronoun 
SBJ subject 
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In this paper, we look into the interaction between focus and prosody in Tagalog. In this 
language, for most focus conditions regular correspondences between syntax and informa- 
tion structure are observed: canonical constructions are used for sentence focus and pred- 
icate focus conditions, while pseudocleft constructions are used for argument focus condi- 
tions. However, some wh-questions, in particular targeting non-agent arguments, can be 
answered by means of canonical constructions as well as pseudocleft constructions. In this 
experimental study, we examine production data in order to test how Tagalog speakers 
prosodically distinguish canonical sentences associated with different focus structures. The 
results reveal that F0 cues and intensity consistently differentiate focused conditions from 
all-old utterances throughout the entire sentence. However, the distinct focus conditions 
are not prosodically differentiated. As for the argument focus condition, there may be dura- 
tional effects applying to the phrase in narrow focus, but this needs further confirmation. 


1 Mismatch between syntactic and focus structure 


Tagalog, an Austronesian language of the Philippines, has VSO word order, displaying 
VO word order correlates in a relatively consistent manner. Thus, in typical transitive 
clauses as in (1) a predicative verb appears in the clause-initial position, followed by 
arguments and adjuncts. Arguments and adjuncts are marked by either determiner-like 
case-markers or prepositions. In this paper, we refer to this type of verb-predicate clause 
as the canonical construction. 


(1) Kumain si Mama ng mami sa kusina. 
K<um>ain si-Mama nang-mami! sa- kusina. 


eat<AV>  P.NOM-Mama GEN=noodles Loc= kitchen 


“Mama ate noodles in the kitchen’ 
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Tagalog also has another construction type, where one of the arguments appears in the 
clause-initial position. We call this construction type a pseudocleft construction in the 
sense that it involves an equational clause structure with a wh-relative clause-like struc- 
ture.2 To illustrate, consider (2) and (3). 


(2) SiMama ang kumain ng mami. 
Si= Mama ang=[k<um>ain nang= mami]. 
P.NOM= Mama NOM=eat<AV>  GEN- noodles 


“The one who ate noodles is Mama. 


*Mama is the one who ate noodles: 


(3) Ang mami ang kinain ni Mama. 
(Ang=)mami ang-[k«in-ain ^ His Mama]. 
NOM-noodles NOM=eat<PV.PFV> P.GEN- Mama 


"What Mama ate is noodles: 


As seen in these examples, canonical and pseudocleft constructions share the same pro- 
positional content. A contrast between the two construction types lies in the focus as- 
signment patterns with which they are associated (Kaufman 2005, Nagaya 2007; see 
Lambrecht 1994 for the notion of focus structure used here). On the one hand, canonical 
constructions are employed for either sentence focus or predicate focus structures, see 
(4) and (5), respectively. 


(4 Q: Anong nangyari? 
Ano -'ng nang-yari? 
what =nom Av:PFv-happen 
“What happened?’ 


A: Kumain si Mama ng mami. 
K<um>ain si-Mama nang= mami. 
eat<AV>  P.NOM-Mama GEN= noodles 


‘Mama ate noodles: 


(5 Q: Anong ginawa ni Mama? 


Ano -'ng g«in-awa ni= Mama? 
what =NOM do<Pv:PFV> P.GEN= Mama 
“What did Mama do?’ 


A: Kumain siya ng mami. 
K«um-ain-siya | nang-mami. 
eat<AV>=3SG.NOM GEN-noodles 


“She ate noodles: 


!In the commonly-used Tagalog orthography, the diagraph ng represents a velar nasal /n/. An exception is 
the genitive case-marker for common nouns, which is pronounced as [nar] but spelled as ng. In this paper, 
however, it is presented as nang instead of ng for the sake of convenience. 

?See Kaufman (2009; 2018 [this volume]) for another view of this construction type. 
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On the other hand, pseudocleft constructions are employed for narrow focus or argu- 
ment focus, where the initial constituent of a clause is exclusively focused. In particular, 


this construction type is the only option in contrastive focus contexts. Example (6) illus- 
trates an explicit contrast. 


(69 A: K<um>ain=daw si=Maria nang= mami. 
eat<Av>=hearsay P.NoM=Maria GEN= noodles 
‘(They say) Maria ate noodles? 
B: Hindi. Si-Mama ang=[k<um>ain nang- mami]. 
NEG P.NOM=Mama NOM-eat«AV-  GEN- noodles 


*No. It is Mama (not Maria) who ate noodles: 


Not surprisingly, wh-questions must take the form of pseudocleft constructions, as in (7) 
and (8). Attention should be paid to the structural parallelism between (2)/(3) and (7)/(8). 


(7) Sino ang=[k<um>ain nang= mami]? [cf. (2)] 
who.NoM NOM-eat«AV- GEN= noodles 
"Who is the one who ate noodles?' 
"Who ate noodles?' 


(8) Ano ang=[k<in>ain ^ ni- Mama]? Jet (3)] 
what NoMzeat«PV:PFV» P.GEN- Mama 
"What is it that Mama ate?' 
"What did Mama eat?’ 


To summarize, in Tagalog, canonical constructions are used for predicate focus (hence- 
forth PF) and sentence focus (henceforth SF), while pseudocleft constructions are em- 


ployed for argument focus (henceforth AF). See Table 1 for a summary of these observa- 
tions. 


Table 1: Construction types and focus structures in Tagalog 


Construction type Focus structure Contexts 


Canonical construction Predicate Focus (PF) ‘What happened to X? 
‘What did X do?’ 
Sentence Focus (SF) "What happened?’ 
Pseudocleft construction Argument Focus (AF) ‘only’ 


focus of negation/correction 
wh-question 


However, the summary in Table 1 slightly overstates the regularity of the correspondence 
between syntactic and focus structure because questions targeting an argument do not 
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require a pseudocleft construction as the answer. Rather, a question such as "What did 
Mama eat?' allows for three types of answers, as seen in (9). 


(9 Q: Ano ang kinain ni Mama? [=(8)] 
Ano ang=[k<in>ain  ni= Mama]? 
what NoM-eat«PV:PFV» P.GEN- Mama 
"What is it that Mama ate?' 
"What did Mama eat?' 
A0: Mami. 
noodles 
‘Noodles: 


Al: Kumain siya ng mami. [Canonical] 
K«um-ain-siya nang= mami. 
eat<AV>=3SG.NOM GEN- noodles 


‘She ate noodles: 

A2: Mami ang kinain niya. [Pseudocleft] 
Mami  ang-k«in-»ain-niya. 
noodles NOM=eat<PV:PFV>=3SG.GEN 
"What she ate is noodles: 


That is, the question Ano ang kinain ni Mama? ‘What did Mama eat?’ can be answered 
with a pseudocleft construction in (9A2) as well as with a canonical construction in (9A1), 
despite the fact that here only one argument is in focus. In (9A1), then, we see a mismatch 
between syntactic and focus structure deviating from the regularities stated in Table 1. 


Note that such a mismatch is not possible when the agent NP is the target of a wh- 
question. Consider (10). 


(10) Q: Sinoang bumili ng mami? 

Sino ang= b<um>ili nang- mami? 
who NoM= buy<Av> GEN= noodles 
‘Who bought noodles?’ 

Al: Si Mama. 

A2: ??Bumili si Mama ng mami. [canonical] 
B<um>ili si= Mama nang= mami. 
buy<Av> P.NoM= Mama GEN= noodles 


‘Mama bought noodles’ 


A3: Si Mama ang bumili. [pseudocleft] 
Si- | Mamaang- b<um>ili. 
P.NOM Mama NoM= buy<AV> 


'It is Mama who bought noodles? 
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To answer a question targeting the agent, one can employ an agent NP by itself as in 
(10A1) or a pseudocleft construction as in (10A3). However, the use of a canonical con- 
struction in (10A2) is not felicitous. So, canonical constructions are only legitimate an- 
swers to argument questions if the argument asked for does not bear the agent role. 

With regard to the constructions where syntactic and focus structure do not prop- 
erly match the generalizations captured in Table 1, the question arises whether in such 
constructions the narrowly focused constituents differ prosodically from non-focused 
constituents. That is, do Tagalog speakers prosodically distinguish argument focus (9A1) 
from predicate focus (5A) in the canonical construction? 

In order to answer this question, we carried out a phonetic experiment. Our working 
hypothesis is that canonical constructions with different focus structures display the 
same syntax but with different prosodic cues, such as MaxFO0 and duration. To the best 
of our knowledge, the interaction between focus and prosody in Tagalog has not been 
well explored in experimental studies (cp. Kaufman 2005). Our study will be the first 
experimental research on this matter. 

The rest of this paper is organized as follows: in 82, we give a detailed description of 
the method employed for this experimental study. In 83, the results of the experiment 
and analyses of them are provided. $4 concludes this paper. 


2 Method 


In this experimental study, we look into the question of whether Tagalog speakers proso- 
dically distinguish canonical sentences associated with different focus structures. To in- 
vestigate this question, we make an acoustic comparison of the target sentence Bumili 
siya nang mami 'She bought noodles' in four different focus contexts: SF, PF, AF, and 
All-Old contexts (henceforth AO). See (11), (12), (13), and (14), respectively. 


(11) Q: Ano =’ng nang-yari? 
what =nom Av:PFv-happen 


"What happened?’ 

A: B<um>ili =siya nang= mami. 
buy<Av> =3SG.NOM GEN- noodles 
‘She bought noodles’ 


(12 Q: Ano -'ng g<in>awa His Mama? 
what -NoM do<Pv:PFV> P.GEN= Mama 


"What did Mama do?’ 


A: Bs<um>ili =siya nang= mami. 
buy<Av> =3SG.NOM GEN- noodles 


‘She bought noodles’ 
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(13 Q: Ano =’ng [b<in>ili ni= Mama]? 
what =Nom buy<PV:PFV> P.GEN= Mama 


"What did Mama buy?’ 


A: B<um-ili=siya nang= mami. 
buy<AV>=3SG.NOM GEN- noodles 


“She bought noodles: 
(14 Q: B«um-ili =ba si- Mama nang- mami? 
buy<Av> =Q P.NOM- Mama GEN- noodles 
“Did Mama buy noodles?' 
A: Oo, b<um>ili =siya nang= mami. 


yes buy<AV> =3SG.NOM GEN- noodles 


"Yes, she bought noodles: 


For this experiment, five male participants were recorded. See Table 2. All of them are 
college students in their twenties. They are native speakers of Tagalog but from different 
dialectal backgrounds: Quezon City (3), Rizal (1), Laguna (1). They also speak English as a 
second language. The recordings were made at the University of the Philippines, Diliman. 
A]l recording sessions were organized and supervised by the first author. A portable 
recorder (Zoom H5) with a head-mounted microphone (Shure Beta 54) was employed 
for the recordings. 


Table 2: List of participants 


Participant Hometown Gender Age 
Speaker 1 Laguna male 21 
Speaker 2 Quezon City male 20 
Speaker 3 Quezon City male 21 
Speaker 4 Rizal male 23 
Speaker 5 Quezon City male 25 


During the recording sessions, participants were asked to read the answers in a list of 
question-answer pairs. The four target pairs (SF, PF, AF and AO contexts) were randomly 
dispersed together with nine dummy pairs. See the Appendix for the complete list of 
question-answer pairs used for this experiment. Each participant repeated the whole list 
ten times. 

At the recording, each participant was instructed to exchange a conversation with 
another participant. More precisely, one participant asked the questions, and another 
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participant answered them.? Speaker 1 was paired with Speakers 2 and 3. Speakers 4 
and 5 were paired. Only answers were recorded. Before the actual recording session, 
participants were asked to practice by reading the two sets of sentences. 


3 Results and discussion 


3.1 Impressionistic comparison of pitch contours 


A total of 200 utterances (4 information status x 5 speakers x 10 repetitions) were ana- 
lyzed. In analyzing the data, prosodic word boundaries were manually marked on each 
utterance. The target sentence Bumili siya nang mami ‘She bought noodles’ was divided 
into three prosodic words: 


e bumili ‘bought’ (P) 
e siya ‘she’ (N) 
e nang mami ‘noodles’ (A) 


For impressionistic comparison of the pitch contours as a function of information sta- 
tus, time-normalized pitch tracks in semitone are plotted in Figure 1, averaging across 
all renditions by each speaker. Overall, the AO condition yielded lower F0s compared to 
all focused conditions across all speakers. In comparing different focus types, however, 
speakers exhibited slightly distinct patterns. As shown in the top-left panel of Figure 1, 
Speaker 1 produced the SF condition (dark solid line) with a slightly higher pitch than 
the other focus conditions, but no substantial difference was observed between PF (dot- 
ted line) and AF (dashed line) in terms of F0. On the other hand, Speaker 2 (top-right 
panel) exhibited somewhat higher F0 peaks of P and A in the PF condition (dotted line) 
than in the other focus conditions. The prosodic manifestation of information status of 
this particular speaker seems to be different from the other speakers in that the overall 
shapes of contours are quite distinct. Specifically, the contours of Speaker 2 in the PF 
and AO conditions show a different overall pattern from the ones found for AF or SF 
whereas those of the other speakers exhibit more or less similar overall contour shapes 
in all information conditions. Speaker 3 (mid-left panel) seems to be quite sensitive to the 
presence or absence of focus, but does not distinguish different types of focus; PF, SF and 
AF yielded nearly the same FO contours. Speaker 4 (mid-right panel) and Speaker 5 (bot- 
tom panel) produced SF and PF with a somewhat higher F0 than AF but no remarkable 
difference was found between information conditions. 


3We thank one of the reviewers who hinted at possible effects of convergence between two speakers (see 
Garrod & Pickering 2009; Kim et al. 2012; Gorisch et al. 2012) in this setting. However, it seems that such 
effects were not seriously large in our data because two speakers who exchanged conversations in the 
recording session exhibited quite different prosodic patterns. 

4p”, “N”, and “A” are labels for prosodic words. They are abbreviations of “predicate”, “nominative”, and 
“accusative”. But this does not imply that Tagalog has a nominative-accusative case system. 
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Speaker 1 Speaker 2 


94 


86 


semitone 


I" 


semitone 


semitone 


Figure 1: FO contours of each speaker in semitone: SF, AF, PF and AO are rep- 
resented by dark solid lines, dashed lines, dotted lines and light solid lines, 
respectively. 


3.2 Statistical analyses 


In order to compare prosodic characteristics of different information conditions, maxi- 
mum F0 (MaxF0), minimum F0 (MinF0), mean FO, mean intensity, and duration values 
of each prosodic word were extracted using the Praat script ProsodyPro (Xu 2013). 

For statistical analysis, linear mixed-effects analyses were conducted using JMP 9, with 
the speaker as random effects and information status as fixed effects. MaxF0, MinF0, 
meanF0, mean intensity, and duration were used as dependent measures. The analyses 
were performed separately for each phrase. All reported effects were significant at the p 
« 0.05 level. The results of our analyses are summarized in Table 3. 
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Table 3: Results of statistical analyses 


P (bumili) N (siya) A (nang mami) 
MaxF0 PF-SF-AF-AO  SF-PF-AF-AO  SF-PF-AF-AO 
MinF0 AF=PF=SF>AO  SF-PF-AF-AO  AF-PF-SF-AO 
mean F0  PF-SF-AF-AO  SF-PF-AF-AO  SF-PF-AF-AO 
intensity PF=AF=SF>AO  PF-AF-SF-AO  PF-AF-SF-AO 
duration PF=AF=AO=SF  AF-PF-AO-SF  AF-PF-SF-AO 


Prosodically, all the conditions show the same patterns for P (bumili) and N (siya); 
in these parts of the sentence, all the focus conditions were realized with significantly 
higher F0 and greater intensity than the AO condition while different types of focus 
were not prosodically differentiated. Interestingly, duration was not significantly differ- 
ent among the four information conditions. 

Similar results are observed in the A phrase (nang mami). Focus conditions yielded 
highest FO, greater intensity, and longer duration compared to the AO condition. How- 
ever, the four conditions did not differ significantly with respect to the acoustic measure- 
ments. Unlike the P phrase (bumili) and the N phrase (siya), this phrase was realized with 
longer duration when it received focus. It is conceivable this is an effect of narrow focus. 
Yet, further investigation involving more speakers and material would be necessary to 
confirm this effect. 


3.3 Discussion 


The results of our analyses reveal two important facts about the interaction between 
focus and prosody in Tagalog. First, it was observed that F0 and intensity consistently 
differentiated focused conditions from AO. This observation was also confirmed by the 
statistical analyses. Second, no significant prosodic differences were observed between 
the distinct focus constructions. 

A general problem for these conclusions, however, pertains to the fact that the into- 
national contours of the target sentences vary from speaker to speaker to an extent that 
needs further explanation. The five speakers did utter the same sentence but with quite 
different contours, as demonstrated in Figure 1. The pitch contours of Speaker 4 may 
appear to be reasonably similar to that of Speaker 5 to be considered minor variants of 
the same overall pattern. But the similarities between the remaining contours are less 
easily amenable to a single underlying melody. It is not clear yet how to account for 
this variation among Tagalog speakers. Dialectal differences could be one factor to con- 
sider. However, the prosodic characteristics of different dialects in Tagalog are next to 
unknown, so this has to remain a speculation at this point. Further, Speakers 2, 3, and 
5 produced noticeably different patterns though they are from the same region. Thus, it 
seems that this large between-speaker variation cannot be attributed solely to dialectal 
differences. 
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4 Conclusion 


In this paper, we presented a preliminary experimental phonetic analysis of the inter- 
action between focus and prosody in Tagalog. In particular, we highlighted mismatch- 
ing patterns between syntax and information structure found in question-answer pairs. 
Some wh-questions, specifically ones targeting non-agent arguments, can be answered 
by means of canonical constructions as well as pseudocleft constructions, despite the fact 
that for most focus conditions Tagalog displays regular correspondences between syn- 
tax and information structure: canonical constructions are used for SF and PF conditions, 
while pseudocleft constructions are used for AF conditions. 

Our working hypothesis was that there might be prosodic cues to distinguish canoni- 
cal constructions associated with different focus structures. The results of our production 
study reveal that F0 cues and intensity consistently differentiate focused conditions from 
all-old utterances throughout the entire sentence. As for the argument focus condition, 
there may be durational effects applying to the phrase in narrow focus, but this needs 
further confirmation. 
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Abbreviations 

AV actor voice PFV perfective 

cv circumstantial voice PROS prospective 
DUP reduplication PV patient voice 
GEN genitive SG singular 

GER gerund PL plural 

IPFV imperfective 1 first person 
LK linker 2 second person 
LOC locative 3 third person 
LV locative voice “<>” infix 

NEG negator = cliticization 
NOM nominative s reduplication 
P personal name 
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Appendix: Target sentences 


Four target sentences and nine dummy sentences were employed in this experiment. 
Below is the list of the target and filler sentences: sentences (16), (21), (23), and (27) are 
targets (highlighted in bold so that they can be spotted more easily), while the others 
function as fillers. In the recording sessions, the entire list was repeated ten times. The 
participants were asked to read these sentences in this order. Only the parts in italics 
were presented to the participants (i.e., no morphological analyses, interlinear glossing, 
or translations). 


(15) Q: Saan ka pupunta? 
Saan -ka pu-punta? 
where =2SG.NOM AV:PROS:g0 
“Where are you going?’ 
A: Sa Ministop ako pupunta. 
Sa= Ministop =ako pu~punta. 
LOC- Ministop =1SG.NOM AV:PROS:gO 


‘I am going to a Ministop’ 


(16) Q: Anong binili ni Mama? 


Ano = "g b<in>ili ni= Mama? 
what =NOM Pv:Prv:buy P.GEN= Mama 
"What did Mama buy?' 

A: Bumili siya ng mami. 
B«um»ili =siya nang- mami. 
Av:buy =3SG.NOM GEN= noodles 
‘She bought noodles: 


(17 Q: Sino ang bumili ng mami? 
Sino ang= b<um>ili nang- mami? 
who NoM- Av:buy GEN- noodles 
"Who bought noodles?' 
A: Si Mama ang bumili. 
Si- Mama ang= b<um>ili. 
P.NOM Mama NoM- Av:buy 


‘It is Mama who bought noodles: 


(18) Q: Ano pa binili ni Mama? 
Ano -pa (=ang) b<in>ili ni= Mama? 
what =else =NOM PV:PFV:buy P.GEN= Mama 


‘What else did Mama buy?’ 
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A: Mami lang ang binili niya. 
Mami =lang ang= b<in>ili  =niya. 
noodles =only NoM- Pv:PFV:buy =3SG.GEN 


‘She bought only noodles: 


(19 Q: Saan ka pumunta? 
Saan -ka p<um>unta? 
where =2sG.NOM AV:go 
"Where did you go?' 
A: Pumunta ako sa Ministop. 
P<um>unta -ako sa- Ministop 
AV:PFV:gO =1SG.NOM LOC- Ministop 


‘I went to Ministop’ 


(20) Q: Mani ba ang kinain niya? 
Mani | -baang- k<in>ain -niya? 
peanuts =Q NOM= PV:PFV:eat =3SG.GEN 
"Did she eat peanuts?' 

A: Hindi. Mami ang kinain niya. 

Hindi. Mami ang= k<in>ain -niya. 
NEG noodles NOM= PV:PFV:eat =3SG.GEN 
‘No. She ate noodles: 


(21) Q: Anong ginawa ni Mama doon? 
Ano -'ng g«in-awani- Mama doon? 
what =NOM Pv:Prv:do P.GEN Mama there 
"What did Mama do there?’ 
A: Bumili siya ng mami. 
B«um-» ili =siya nang- mami. 
Av:buy =3SG.NOM GEN= noodles 
‘She bought noodles: 


(22) Q: Anong paborito mong pagkain? 
Ano -ng paborito mo=ng pagkian? 
what =NoM favorite 25G.GEN-LK food 
"What is your favorite food?’ 


A: Paborito ko ang mami. 
Paborito -ko ang- mami 
favorite =1SG.GEN NOM= noodles 


‘Noodles are my favorite’ 
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(23) 


(24) 


(25) 


(26) 


(27) 


: Anong nangyari? 


Ano =’ng nang-yari? 
what =NOM Av:Prv:happen 


"What happened? 

: Bumili siya ng mami. 
B«um»ili =siya nang- mami. 
Av:buy =3SG.NOM GEN- noodles 
‘She bought noodles: 


: Sino ang bumili ng mami? 


Sino ang= b<um>ili nang= mami? 
who NoM- Av:buy GEN= noodles 


‘Who bought noodles?’ 


: Bumili si Mama ng mami. 


B<um>ili si- Mama nang- mami? 
Av:buy  r.NoM- Mama GEN= noodles 


"Mama bought noodles: 


: Anong binili ni Mama? 


Ano = "g b<in>ili ni= Mama? 
what =NOM Pv:Prv:buy P.GEN= Mama 
"What did Mama buy?' 

: Mami ang binili niya. 
Mami  ang- b«in-ili ^ -niya. 
noodles NOM= Pv:PFV:buy =3SG.GEN 
‘She bought noodles: 


: Masarap ba ang mami nila? 


Ma-sarap =ba ang= mami =nila? 
ADJ-delicious =Q NoM- noodles =3PL.GEN 


"Are their noodles delicious?’ 


: Oo. Masarap ang mami nila. 


Oo Ma-sarap | ang- mami  -nila. 
yes ADJ-delicous NoM= noodles =3PL.GEN 


‘Yes, their noodles are delicious: 


: Bumili ba si Mama ng mami? 


B<um>ili =ba si= Mama nang= mami? 
Av:buy =Q P.NOM= Mama GEN= noodles 
‘Did Mama buy noodles?’ 
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A: Oo, bumili siya ng mami. 
Oo, b<um>ili =siya nang= mami. 
yes Av:buy =3SG.NOM GEN= noodles 


"Yes, she bought noodles: 
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This paper reports the results of two perception experiments on the prosody of Papuan 
Malay. We investigated how native Papuan Malay listeners perceive prosodic prominences 
on the one hand, and boundaries on the other, following the Rapid Prosody Transcription 
method as sketched in Cole & Shattuck-Hufnagel (2016). Inter-rater agreement between the 
participants was shown to be much lower for prosodic prominences than for boundaries. 
Importantly, however, the acoustic cues for prominences and boundaries largely overlap. 
Hence, one could claim that inasmuch as prominence is perceived at all in Papuan Malay, it 
is perceived at boundaries, making it doubtful whether prosodic prominence can be usefully 
distinguished from boundary marking in this language. Our results thus essentially confirm 
the results found for Standard Indonesian by Goedemans & van Zanten (2007) and vari- 
ous claims regarding the production of other local varieties of Malay; namely, that Malayic 
varieties appear to lack stress (i.e. lexical stress as well as post-lexical pitch accents). 
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1 Introduction 


Papuan Malay (henceforth PM) is a local variety of Indonesian/Malay, spoken in the 
two easternmost provinces of Indonesia - Papua Barat and Papua - by approximately 
1,200,000 speakers (see Kluge 2014). It is spoken mostly in the coastal areas, and to a 
lesser extent in the mountainous inland. Indonesian Papua, with its more than 270 in- 
digenous languages, is linguistically highly diverse, and most speakers are at least bilin- 
gual. Papuan Malay serves as the lingua franca in this area, and most native speakers 
speak PM in addition to one or more local languages. 

This paper reports on two perception experiments that investigate the contribution 
of prosody with respect to how native speakers of PM perceive prosodic prominences 
and boundaries in natural speech. It thus stands alongside a growing number of recent 
papers that discuss the prosodic systems of different varieties of Indonesian, such as, 
for example, the study by Goedemans & van Zanten (2007) on Standard Indonesian and, 
most recently, the paper by Maskikit-Essed & Gussenhoven (2016) on Ambonese Malay. 

For along time, the standard assumption has been that (Standard) Indonesian displays 
lexical stress on the penultimate syllable, unless this syllable contains a schwa, in which 
case stress falls on the final syllable (cf. Alieva et al. 1991; Cohn 1989). Secondary stress 
has been claimed to fall on the first syllable and every odd syllable thereafter, but never 
on the one adjacent to the syllable that carries the main stress (Cohn & McCarthy 1994). 
Other authors have claimed that schwa in (some varieties of) Indonesian can be stressed 
just as well as any other vowel (Halim 1974; Laksman 1994). 

However, in a growing number of studies, the claim that Indonesian displays lexical 
stress on the penultimate syllable has been challenged. While some authors found that 
there is a preference for stress to occur on the penultimate syllable but free variation — 
especially in longer words - is possible (cf. van Zanten 1994; van Zanten & van Heuven 
2004), other authors came to the conclusion that there is no lexical stress at all (Zubkova 
1966; Odé 1997). Especially in more recent publications, it has been pointed out that the 
aforementioned disagreement as to whether or not Indonesian displays lexical stress is 
probably due to the fact that "Indonesian" as a reasonably homogeneous language does 
not exist. Around 700 indigenous languages are spoken in the Republic of Indonesia (cf. 
Simons & Fennig 2018), with the great majority of people being at least bilingual, speak- 
ing a local language in addition to (some variety of) Indonesian. Often, Indonesian is 
learned as a second language, usually from the age of six or seven, when children enter 
primary school and are exposed to Indonesian as the language of education. Further- 
more, in addition to standard Indonesian and the indigenous languages, local varieties 
of Malay are spoken in many regions of Indonesia (e.g. Ambonese Malay, Jambi Malay, 
Kupang Malay, Manado Malay, Papuan Malay, etc.). Often, these local varieties of Malay 
take the place of standard Indonesian and are the major means of everyday communi- 
cation. It is thus very likely that studies on lexical stress in "Indonesian" are based on 
data from speakers with different substrate dialects and languages, which means that 
the contradictory results of such studies are probably due to the different prosodic prop- 
erties of these substrates. More recent studies therefore make an effort to control for 
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the linguistic background of the participants in their experiments. Yet even these more 
recent studies provide results that are not straightforward to interpret, an assessment 
which is also valid for our study, as further detailed in $5. This is in part due to the 
fact that more recent studies - even if they control for substrate influence - continue to 
have issues regarding the naturalness of the word tokens under investigation (often loan 
words four syllables in length or even longer) and adequate sampling. Many studies rely 
on non-natural lab speech, often produced by a single speaker, and evaluated by only a 
few more. 

Goedemans & van Zanten (2007), for example, conducted a carefully designed percep- 
tion experiment with two groups of participants with different linguistic profiles: one 
group consisted of speakers of Indonesian with Javanese as their substrate language, the 
other group consisted of speakers of Indonesian who were additionally native speak- 
ers of Toba Batak. These two languages were chosen because Toba Batak is said to 
exhibit clearly defined stress, while Javanese is said to have only weak stress, the lo- 
cation of which lacks consensus in the literature (Goedemans & van Zanten 2007: 40). 
As stimuli, the authors recorded material from one Toba Batak Indonesian speaker and 
one Javanese Indonesian speaker. This material was manipulated so that presumably 
prominence-lending phonetic cues, i.e. pitch excursions, duration and intensity, would 
occur on different syllables. It was then judged for acceptability by listeners of the two 
different groups. The Javanese listeners did not show any preference for stress on either 
the penultimate or the ultimate syllable for both the Javanese Indonesian and the Toba 
Batak Indonesian stimuli. The Toba Batak listeners, on the other hand, clearly preferred 
penultimate stress in the Toba Batak speech data, but showed no clear preferences for the 
Javanese data. Goedemans & van Zanten interpret these results as evidence against lexi- 
cal stress in Javanese Indonesian. Though their experiment was explicitly not designed to 
investigate prominence above the word level, they do observe that phrasal prominence 
always occurs close to the boundary. They they come to the conclusion that "the dis- 
tinction between accent lending and boundary marking intonation movements is very 
difficult to make in Indonesian" (Goedemans & van Zanten 2007: 57). 

One of the few studies that address the issue of phrasal prominence in more detail is 
the work by Maskikit-Essed & Gussenhoven (2016) on Ambonese Malay (see also Stoel 
2007 on Manado Malay, Himmelmann 2010 on Waima'a, and Clynes & Deterding 2011 
on Brunei Malay). Maskikit-Essed & Gussenhoven conducted a production experiment 
with four native speakers of Ambonese Malay. They recorded 80 mini-dialogs consisting 
of read question-answer pairs, which contained eight target nouns in different positions 
(phrase- and IP-final as well as phrase- and IP-medial) and were controlled for different 
focus conditions. In these eight target words, no evidence for (post-)lexical stress in the 
putative stressed syllables (ultimate or penultimate, depending on the word) was found. 
Furthermore, the phrase-final pitch movement, which is a typical feature of declarative 
mood in many languages in the area (Himmelmann 2010: 67), is not tied to either the 
final or the prefinal syllable. Rather, it is sensitive to the available space and tends to 
be timed earlier when the word is longer. Finally, Maskikit-Essed & Gussenhoven tested 
two focus conditions, one in which the phrase-final target word was in focus, and one 
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in which it occurred in post-focal position, i.e. a focal element preceded the phrase-final 
target word. In the latter condition, the authors could not find any signs of reduction of 
the post-focal target words, either in duration or in pitch height. Furthermore, the pitch 
contours were similar, not only on the target words, but also over the whole sentences 
(Maskikit-Essed & Gussenhoven 2016: 372). Taking these results together, Maskikit-Essed 
& Gussenhoven come to the conclusion that information focus in Ambonese Malay is 
not expressed by means of prosody. 

For Papuan Malay, Kluge (2014) recorded 1,072 words in two different carrier sentences, 
one in which the target word occurs clause-finally, and one in which it appears in clause- 
medial position.! Kluge concludes that 964 (90%) of all words have penultimate stress 
(including both open and closed penultimate syllables), and only 108 (10%) show stress on 
the final syllable. Of those 108 words that displayed ultimate stress, 105 (97%) contained 
the front open-mid vowel /e/ (the equivalent of Indonesian schwa) in the penultimate 
syllable. Yet, Kluge notes that /e/ does not condition ultimate stress, since 65 of those 
words with penultimate stress, the stressed syllable also contained an /e/. In addition, 
three words with ultimate stress contained /i/ and /u/ vowels in the penultimate syllable 
(Kluge 2014: 89). 

Based on this analysis, Papuan Malay would appear to be very similar to Ambonese 
Malay as presented in the grammar by van Minde (1997), where it is claimed that Am- 
bonese Malay has regular penultimate stress, with a small number of lexical items show- 
ing ultimate stress. Note that in both grammars, the analysis is based primarily on the 
auditory impression of the Western researcher who hears one or the other syllable as 
more prominent. It is unclear what Ambon and Papuan Malay native speakers actually 
hear. The present study is a first exploration of this question. Recall from above that 
Maskikit-Essed & Gussenhoven (2016) did not find clear acoustic evidence for (lexical) 
stress or (post-lexical) pitch accents in Ambonese Malay. Hence, it may very well be the 
case that Western ears tend to hear these languages according to the categories they 
know from their own prosodic systems, and not necessarily based on the ‘objectively’ 
available acoustic input. That is, if Maskikit-Essed & Gussenhoven's (2016) findings hold 
up to further scrutiny, the phrase-final pitch movement in Ambon Malay that is heard by 
Western researchers as being clearly located on either the penultimate or the ultimate 
syllable is actually most often (i.e. in terms of the measurable acoustic cues) located 
somewhere in between the final two syllables and thus, strictly speaking, is not properly 
anchored to either one, but rather to the phrase-final boundary. 

In targeting perception rather than production, the current study takes up the line 
of research pioneered by Leiden phoneticians in the 1990s, though with a somewhat 
different methodology (see the book edited by van Heuven & van Zanten 2007 for a 
summary). With regard to these studies, Papuan Malay would appear to be most simi- 
lar to Toba Batak, for which a system with predominantly penultimate and occasional 
ultimate stress has also been reported, though possibly with a higher functional load, as 


1The two carrier sentences Kluge used are: Sa blum taw ko pu kata itu, kata xxx. ‘I don't yet know that word 
of yours, the word xxx’ and Ko pu kata xxx itu, sa blum taw. ‘Your word xxx, I don’t know yet. (Kluge 2014: 
57). 
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a fair number of minimal stress pairs are claimed to exist (Roosman 2007: 92ff provides 
a succinct summary of the literature). Unfortunately, Roosman (2007) does not investi- 
gate prominence perception by Toba Batak speakers of their native tongue. Moreover, 
the work by Goedemans & van Zanten (2007) discussed above only looks at the percep- 
tion of different varieties of Indonesian by native Toba Batak speakers. Hence, the results 
here will not be directly comparable with the results reported by the Leiden group. It will 
nevertheless begin to sketch out one of the constellations not yet investigated in detail, 
i.e. the native perception of a prosodic system which - to Western ears - appears to have 
a fairly clear lexical stress system with predominantly penultimate stress. 

In concluding these introductory remarks, it bears emphasizing that although much 
of the literature — and consequently also parts of this introduction — makes reference 
to phonological categories, including in particular "(lexical) stress", such categories only 
make sense as part of a comprehensive analysis of the prosodic system of a given lan- 
guage. Since such an analysis does not yet exist for Papuan Malay, the main purpose of 
the current chapter is to provide perceptual data for a more comprehensive investigation 
ofthe Papuan Malay system, which in addition will require a rigorous and detailed acous- 
tic analysis, a task currently being undertaken by one of the authors (Himmelmann). 

The present chapter is structured as follows: $2 describes the experimental setup and 
methods, before 83 and 84 report on the results of the two experiments (on prominences 
and boundaries, respectively). 85 summarizes the findings and draws some preliminary 
conclusions on the interrelation between the perception of prosodic cues and their in- 
terpretation by native listeners of Papuan Malay. 


2 Methods 


Given the growing amount of evidence in the literature to support the assumption that 
the prosodic systems of different varieties of Malay differ significantly from the better- 
known European systems, we wanted to address the question of how native speakers 
of one of these varieties - Papuan Malay - interpret prosodic cues if required to judge 
the presence or absence of prominences and boundaries. We therefore conducted two 
perception experiments using the Rapid Prosody Transcription (RPT) method, as intro- 
duced in different papers by Jennifer Cole and colleagues (cf. Mo et al. 2008; Cole, Mo 
& Hasegawa-Johnson 2010; Cole, Mo & Baek 2010; Cole & Shattuck-Hufnagel 2016: 7- 
13). In the RPT setup, ordinary listeners who are naive with respect to prosodic analysis 
listen to excerpts of audio recordings. They are given minimal instructions (see below) 
and are allowed to play the audio recordings only twice. On a printed transcript of the 
recorded excerpts, in which punctuation and capitalization have been removed, the par- 
ticipants are either asked to underline those words which they perceive as prominent 
(prominence experiment), or to draw a vertical line after the word which they perceive 
to be the last word of a prosodic unit (boundary experiment). 

The advantage of this method is its simplicity and directness, providing us with coarse- 
grained linguistic data: prosodic judgments by untrained listeners, which are based on 
the listeners' holistic perception of form and function. As noted by Cole and colleagues, 
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the prominence and boundary judgments elicited in this task are clearly not based ex- 
clusively on prosodic factors, but also include morphosyntactic, semantic and pragmatic 
factors. Our main concerns here are prosodic factors, but some of our variables (for exam- 
ple, the distinction between content and function words) also target these other levels. 


2.4 Subjects 


The raters of our perception study were 44 native speakers of Papuan Malay (22 for the 
prominence experiment, 22 for the boundary experiment). Of the 22 participants in the 
prominence experiment, 15 were female. 15 were bilingual in Papuan Malay and standard 
Indonesian, and 7 participants were additionally proficient in another local language. Of 
the 22 participants in the boundary experiment, 12 were female. 17 subjects were speak- 
ers of Papuan Malay and standard Indonesian, and 5 spoke another local language. All 
44 participants were students at the Universitas Papua (UNIPA) in Manokwari (West 
Papua), and between 18 and 28 years of age. All participants stated that Papuan Malay 
was (one of) their first language(s)"and that Papuan Malay was their first language of 
communication at home and at university, as well as when talking to friends. None of 
them had any experience in prosodic analysis or reported any hearing or reading prob- 
lems. 


2.2 Stimuli and procedure 


The participants annotated 56 excerpts of audio recordings (the same for both the promi- 
nence and the boundary experiment). These excerpts were taken from a corpus of natural 
speech, consisting of speakers re-telling Chafe’s Pear Movie (Chafe 1980) and playing the 
Tangram Task.’ Excerpts thus consisted of both monologues (the pear movie recordings) 
and dialogues (the tangram recordings). They were of varying lengths, ranging from 1 
to 15 seconds, and included 28 different native speakers of Papuan Malay (17 female, 11 
male). 

Instructions for the participants of the experiments were, as stated above, minimal. 
They included a short written description of what we mean by prominence and bound- 
aries, respectively. For the prominences, it was explicitly stated that underlining more 
than one word per excerpt was allowed. No audio examples were given, but both instruc- 
tions contained a written example that illustrated how to mark either prominences or 
boundaries, and how choices could be corrected, if necessary (see Appendix A for the 
original instructions in Indonesian, and Appendix B for English translations). 

The data in (1) show an example of one of the excerpts, including glosses and transla- 
tion (1a), and how it was presented to the participants of the experiment (1b). (1c) shows 


“Four further participants that took part in the prominence experiment were excluded from the results 
because they had learned Papuan Malay only at a later age when they entered primary school. They were 
therefore not considered native speakers, even if they had lived in Manokwari for several years and their 
dominant language was Papuan Malay at the time of the experiment. 

3The Tangram Task is an elicitation method that involves two speakers negotiating whether the picture 
described by speaker one is the same as the picture given to speaker two. 
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the prominence choices made by one of the participants (RW, female, 23 years), (1d) in- 
dicates the boundary positions perceived by another participant (JGL, female, 25 years). 


(1) Papuan Malay 
a. yang tiga orang ini pegang topi satu 
REL three person DEM carry hat one 
‘The three people are carrying a hat? 
b. yang tiga orang ini pegang topi satu 


c. yang tiga orang ini pegang topi satu 
d. yang / tiga orang ini / pegang topi satu 


2.3 Test variables 


We tested the influence on the native listeners’ judgments of a number of prosodic and 
morphosyntactic cues which have been found to have an effect on prominence or bound- 
ary perception in other (generally West Germanic) languages. For each test word in both 
experiments, we investigated the following prosodic factors: word duration (in ms), mean 
duration of syllables (in ms), duration of the last syllable within a word (in ms), minimum, 
maximum and mean pitch (in Hz), absolute pitch range (in semitones), number of syl- 
lables (both abstract phonological and actually realized) as well as presence of a pause. 
An increase in duration, pitch height and pitch range have been shown in many studies 
to correlate with higher perceived prominence in Germanic languages (e.g. Cole, Mo & 
Hasegawa-Johnson 2010; Rietveld & Gussenhoven 1985), while presence of a pause and 
domain-final lengthening has been shown to trigger the perception of a phrase break 
(e.g. Turk & Shattuck-Hufnagel 2007). 

Furthermore, we analyzed the morphosyntactic cues part-of-speech (POS), part- of- 
speech class (i.e. content words vs. function words), whether the word is the last verbal 
argument in the excerpt, and syntactic break (three levels: no, weak or strong break). 
The label weak break was assigned to sentence-medial words that were followed by a 
subordinate clause (e.g. relative clause), while the label strong break was assigned to 
sentence-final words. Again, all these structural factors were chosen from a European 
point of view, since West Germanic languages are known to be sensitive to these param- 
eters. In English and German, function words are usually less prominent than content 
words (Büring 2012: 31), while the last verbal argument in a sentence is of importance 
when it comes to focus projection, i.e. in the default intonation of a broad focus sentence, 
the last verbal argument receives the nuclear accent (Uhmann 1988: 66). 

In addition to these linguistic factors, we correlated the experiment’s outcome with 
an expert rating of prosodic boundaries, which represents the consensus judgments of 
the authors, all of them German natives. Boundaries in this version are based on the 
consensus of at least three of the four authors. In a pre-test, this expert rating was sta- 
tistically analyzed with the same factors examined for the native raters, showing strong 
influences of pause, overall word duration, mean syllable duration and syntactic struc- 
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ture. The effect of syntactic structure is somewhat surprising, as two of the authors do 
not know the language and thus have no understanding of the syntax. 


2.4 Data analysis 


Both experiments consisted of binary classification tasks. In the prominence experiment 
(Experiment I), participants had a binary choice for each word in the transcript to rate 
it as either prominent or non-prominent. In the boundary experiment (Experiment II), 
there was a choice for each consecutive pair of words to either place a boundary between 
them or not. That is, for an excerpt containing n words, there were (n - 1) consecutive 
word pairs and thus (n - 1) potential boundaries the rater had to decide upon, since no 
judgment was needed after the last word of an excerpt. Given that our set of 56 excerpts 
consisted of 730 words altogether, each participant thus produced 730 data points in the 
prominence experiment and 674 data points in the boundary experiment. 

For the statistical analysis of these data, a mixed effects logistic regression was per- 
formed using the Ime4-package (Bates et al. 2015) in R (R Core Team 2015), which suits 
both continuous and categorical input variables. As this study is exploratory in nature, 
we only created single effect models (e.g. only maximum pitch or part-of-speech, but not 
both variables) with random effects for speaker, sentence and rater. Subsequently, odds 
ratios were calculated to enable a comparison of the factors by means of effect size in 
order to determine which cue had the strongest influence on the raters' judgments. 

We further calculated both the Fleiss' kappa coefficient (plus its z-normalized score) 
and Cohen's kappa. Fleiss' kappa provides a single coefficient as a measure of agreement 
across all raters. Cohen's kappa calculates agreement between an individual pair of raters 
for each word/consecutive pair of words, comparing the labels (i.e. prominent - non- 
prominent, and boundary - no-boundary, respectively). 

In addition, we calculated the prominence-score (p-score) and the boundary-score (b- 
score), which serve as relative measures representing the ratio of subjects that under- 
lined a word, i.e. that perceived a word as prominent, or drew a vertical line, i.e. per- 
ceived a prosodic break, with respect to the total number of participants. An example 
showing p- and b-scores is given in Figure 1. 


3 Results of the prominence experiment 


3.1 Inter-rater and multi-rater agreement 


As mentioned above, we measured the overall inter-rater agreement for both experi- 
ments by calculating Fleiss' and Cohen's kappa coefficients. These two measures allow 
us to compare the performance of the two rater groups between the two experiments. 
They also make it possible to compare our results with similar studies that used RPT 
to investigate native speakers' perception of prominences in American English and in 
German. 
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yang  tiga  orang ini  pegang topi satu 


Figure 1: P- and b-scores for one PM excerpt (cf. (1) above). The higher the value, 
the more participants perceived a word as prominent (gray line with triangles) 
or perceived a boundary after the respective word (black line with squares). 
Recall that no b-score has been calculated for the last word of an excerpt. 


The Fleiss' kappa score we calculated for the prominence experiment amounts to 
0.103 (z = 42.1), a value that turns out to be surprisingly low in cross-linguistic compari- 
son. In Table 1, we compare the PM inter-rater scores from the prominence experiment 
with those of two comparable studies on American English (Mo et al. 2008; Cole, Mo 
& Hasegawa-Johnson 2010) and German (Baumann & Winter to appear). The study by 
Cole and colleagues used spontaneous conversational speech from the Buckeye Corpus, 
which consists of interviews with adult speakers of American English from Columbus 
(Pitt et al. 2007). Baumann & Winter's study, on the other hand, used read sentences that 
displayed different focus structures and information status categories. Both made use of 
the RPT method as described above. The comparison clearly shows that the PM listeners 
perform significantly worse in the prominence task than English or German listeners. 


Table 1: Fleiss’ kappa for prominences in German, American English, and PM 
rating studies 


PM 
German English 
Fleiss' Kappa 0.53 0.42 0.103 
Z 244 20.4 42.1 


The slightly higher agreement of German raters compared to English raters is probably 
due to the different data types used in the respective experiments, i.e. read speech versus 
spontaneous conversational data. Considering that the naturalness of the stimuli might 
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have an effect on the raters' level of agreement in their perception of prominences, the 
PM scores are probably best compared with the English scores. Still, the difference be- 
tween English raters, with a Fleiss' kappa score of 0.42, and Papuan Malay, with a kappa 
score of only 0.103, is also striking. 

To test whether the low score of the PM raters in the prominence experiment was just 
due to very low agreement between some individual participants, we calculated Cohen's 
kappa scores for every single rater pair. In Table 2, the pair-wise inter-rater agreement 
is summarized, using the agreement categories postulated by Landis & Koch (1977), who 
characterize kappa values between 0-0.20 as slight agreement, 0.21-0.40 as fair, 0.41-0.60 
as moderate, 0.61-0.80 as substantial, and 0.81-1 as (almost) perfect agreement. 


Table 2: Inter-rater agreement categories (based on Cohen's kappa scores) for 
PM subjects in the prominence experiment 


Prominences 
inter-rater agreement pairs percentage 
none 25 10.82% 
slight 164 71.00% 
fair 40 17.32% 
moderate 2 0.87% 
substantial 0 0.00% 
(almost) perfect 0 0.00% 

231 100% 


As we can see, more than 80% of the pairs showed either ‘slight’ or no agreement, and 
for only about 17% of pairs was the agreement ‘fair’. The picture gained by the Fleiss’ 
kappa study is thus confirmed. As we will see in §4.1, both the multi-rater agreement 
and the pair-wise inter-rater agreement in the prominence experiment is much lower 
than in the boundary experiment. 


3.2 Factors determining perceived prominence 


As already indicated by the low kappa values above, we observed a high degree of vari- 
ability in the listeners’ judgments, leading to predominantly low p-scores. In fact, the 
modal value in our data was a p-score of 13.6%, as shown in Figure 2. There was not a 
single item (out of 726+) that all raters considered prominent, the highest p-score be- 
ing 81.8% (18 out of 22 participants agreeing on assigning prominence to a given word), 
which was achieved only three times. Furthermore, there were only twenty words which 
all participants judged as not prominent (out of 726 words altogether). 

When examining which of the 14 test variables influenced the perception of promi- 
nence, only part-of-speech was not found to have a significant effect on prominence 


Four items had to be discarded because no pitch features could be calculated. 
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Figure 2: Distribution of p-scores in the PM data 


judgments (X2(1) = 0.6444, p = 0.4221). Note, however, that the actual effect sizes of the 
various factors were found to be rather small, as indicated by the odds ratios. An odds 
ratio of 1 usually indicates that there is no change in the odds of receiving a certain 
outcome when manipulating the test variable. An odds ratio bigger than 1 indicates an 
increase in the odds of getting a certain outcome (cf. Field et al. 2012: 320, 923), in our 
case a prominence response. We have excluded variables with extremely small odds ra- 
tios from further consideration, in order to concentrate on those effects that are most 
likely to have noticeable effects on prominence judgments. Our threshold was set to an 
odds ratio of 1.5 to 1. This procedure led to the exclusion of all measures relating to pitch 
(maximum, minimum, mean pitch and pitch range) in addition to part-of-speech, number 
of syllables (phonological) and duration of the last syllable. 

The strongest effect was found for pause (X?(1) = 156.26, p < 0.0001), increasing the 

odds of observing a prominence response in the presence of a pause as opposed to the 
absence of a pause by 2.7 to 1 (logit estimate: 1.01, SE = 0.08). Figure 3 shows the relation 
between prominence judgments on a word and a subsequent pause. 
The second most influential factor for the perception of prominence by native speakers of 
Papuan Malay was part-of-speech class (X?(1) = 329.3, p < 0.0001), i.e. content vs. function 
word, as displayed in Figure 4. Being presented with a content word as opposed to a 
function word increases the odds of observing a positive response for prominence by 2.1 
to 1 (logit estimate: 0.73, SE = 0.04). 

As a third factor, overall word duration had an impact on the prominence ratings 
(X?(1) = 857.16, p < 0.0001). In this continuous parameter, a change by one standard 
deviation increases the odds of a prominence response by 1.9 to 1 (logit estimate: 0.62, 
SE - 0.02). Figure 5 shows this effect as a tendency of longer words to reach a higher 
p-score, i.e. the longer the word, the more participants marked it as prominent. 

The remaining four test variables were found to be more marginally relevant and clearly 
overlap with one of the three preceding variables. Thus, mean syllable duration and num- 
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Figure 3: P-scores as a function of presence or absence of a subsequent pause 
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Figure 4: P-scores as a function of part-of-speech class 


ber of syllables (actually realized) - both with odds ratios of 1.6 to 1 - are obviously related 
to word duration. Similarly, syntactic break (odds ratio 1.5 to 1) and last verbal argument 
(odds ratio 1.6 to 1) often overlap with pauses. 


4 Results of the boundary experiment 


4.1 Inter-rater and multi-rater agreement 


The first result to note with regard to inter-rater agreement is that our participants per- 
formed much better in the boundary experiment (Experiment II) than they did in the 
prominence experiment (Experiment I). That is, inter-rater agreement was much higher 
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Figure 5: P-scores as a function of word duration 


in the former than in the latter. Table 3 repeats the Fleiss’ kappa scores for the promi- 
nence experiment (cf. §3.1) and contrasts them with the scores for the boundary experi- 


ment. 
Table 3: Fleiss’ kappa scores for prominences and boundaries in PM 
Prominences Boundaries 
0.103 0.407 
z = 42.1 z = 160 


Comparing the boundary scores again with Mo et al.'s (2008) RPT results for Ameri- 
can English, we see that — in contrast to the prominence scores - English listeners and 
Papuan Malay listeners are not too far apart in their perception of boundaries: 0.544 for 
American English vs. 0.407 for Papuan Malay. 

As with the prominence experiment, we additionally looked at the pair-wise inter-rater 
agreement. Table 4 summarizes the Cohen's kappa values by using the agreement cat- 
egories of Landis & Koch (1977). Compared with the results of the prominence experi- 
ment (repeated in the second and third columns of Table 4), we see a clear difference 
between the two experiments: while in the prominence experiment more than 80% of 
all rater pairs showed either no or only slight agreement, only about 18% of the rater 
pairs showed such low agreement in the boundary experiment. Instead, the majority of 
pairs who participated in the boundary experiment (more than 60%) showed moderate 


or even substantial agreement. 
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Table 4: Inter-rater agreement categories (based on Cohen's kappa scores) for 
PM subjects in both experiments 


Prominences Boundaries 
inter-rater agreement pairs percentage pairs percentage 
none 25 10.82% 4 1.73% 
slight 164 71.00% 37 16.02% 
fair 40 17.32% 51 22.08% 
moderate 2 0.87% 106 45.89% 
substantial 0 0.00% 33 14.29% 
(almost) perfect 0 0.00% 0 0.00% 

231 100% 231 100% 


4.2 Factors determining perceived boundaries 


As we have seen in the previous section, the overall agreement of raters is better in the 
boundary experiment than in the prominence experiment. This is reflected in Figure 6, 
where we observe a modal value of 0, which is to be expected as there are usually many 
more word pairs with no boundaries between them than ones where the two words are 
separated by a boundary. We can also observe a longer tail to the right, indicating that 
higher scores are reached than in the prominence experiment. That is, the participants 
agreed more on the position of boundaries than on the position of prominences (top 
scores: 95.5% as opposed to 81.8%). However, even though the agreement among raters 
is higher for boundaries than for prominences, complete agreement (on the presence of 
a boundary) is never achieved. 

When correlating the multiple possible factors introduced in §2 with the outcome, 
the only variable that does not reach significance is part-of-speech class (X2(1) = 0.7962, 
p = 0.3722). In the same way as with the prominence results, however, we will concen- 
trate only on the strongest effects, indicated by odds ratios bigger than 1.5 to 1. The vari- 
ables not considered further are the morphosyntactic parameters part-of-speech and last 
argument, and the duration/syllable number measures duration of last syllable, number of 
syllables (phonological) and number of syllables (actually realized). This also includes two 
of the pitch measures, i.e. minimum and mean pitch, but note that the other two pitch 
measures (maximum pitch and pitch range) are also only marginally effective (odds ratio 
1.6 to 1 for pitch range and odds ratio 1.7 to 1 for maximum pitch). 

The most significant factor affecting the perception of a boundary in this experiment 
was the presence of a pause (X2(1) = 1519, p < 0.0001). As illustrated in Figure 7, the pres- 
ence of a pause in contrast to a non-interrupted signal increased the odds of a positive 
response for boundary by 22.9 to 1 (logit estimate: 3.13, SE = 0.09). 

Although much weaker, another major effect on the perception of boundaries was found 
in the syntactic structure of the utterances (X7(1) = 1514.2, p < 0.0001). As Figure 8 indi- 
cates, the type of syntactic break influences the perception of a boundary. Thus, a change 
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Figure 6: Distribution of b-scores in the PM data 
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of one unit increases the odds of observing a boundary response by 3.4 to 1 (logit esti- 
mate: 1.23, SE = 0.03). The effect size can be explained by the amount of variability shown 
in the plot and the overly coarse values for this parameter. Thus, there are quite a few in- 
stances where participants agreed on the presence of a boundary even though there was 
no major (clausal) syntactic break. Such boundaries typically involve a clause-internal 
syntactic break such as the right edge of a topic or subject NP. Recall that the syntactic 
break parameter only distinguishes subordinate clause and sentence boundaries from 
no boundary (= all syntactic boundaries within a clause). Furthermore, participants did 
not always agree on perceiving a prosodic boundary at sentence boundaries (= strong 
syntactic breaks), which in part is due to the fact that sentence boundaries are often not 
easy to determine in spontaneous discourse. The high variability, especially in cases of 
a strong syntactic break, leads to a relatively small effect of syntax, although the mean 
values of the two categories weak and strong break are far apart from each other. 
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Figure 8: B-score as a function of syntactic structure 


Next to pause and syntax, the mean duration of syllables was found to be the third 
most important factor for the perception of boundaries, but the effect here is relatively 
weak (X?(1) = 1415.1, p < 0.0001; see Figure 9). Thus, a change of one standard devia- 
tion increases the odds of getting a boundary response by 2.6 to 1 (logit estimate: 0.95, 
SE - 0.029). 

We found almost the same effect size for the parameter word duration (X?(1) = 1423.7, 
p « 0.0001), where a change of one standard deviation increases the odds of observing a 
boundary response by 2.5 to 1 (logit estimate: 0.90, SE = 0.028). 


5 Discussion 


If we compare the results gained in the two RPT-experiments, we find a high degree 
of variability for prominence judgments, but less variability for boundary judgments. 
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Figure 9: B-score as a function of mean syllable duration 


The lack of agreement with regard to prominence judgments is reflected in overall low 
Kappa values and low p-scores (best p-score achieved: 81.8%; modal value: 13.6%). The 
considerably stronger agreement in the perception of prosodic boundaries is shown by 
much higher Kappa values and more consistent b-scores at both ends of the scale (best 
b-score achieved: 95.5%; modal value: 0%). 

When correlating the native judgments of Experiment I and II with the various pa- 
rameters that might affect the perception of prominences and boundaries, respectively, 
we observe an interesting pattern (see Table 5): the two prosodic factors most important 
in influencing prominence and boundary ratings are basically the same; namely, pause 
and word duration/mean syllable duration? Apart from the considerable difference in 
effect size with regard to the parameter pause, the major difference between the two 
experiments pertains to the non-prosodic factor found to be most influential for the rel- 
evant judgment. Part-of-speech class was found to be a relevant cue for prominence but 
not for boundaries. This is not surprising, as content words are generally claimed to be 
more prominent than function words, due to their higher semantic weight or structural 
strength (see Büring 2012) and to their (commonly) lower word frequency (see Cole, 
Mo & Hasegawa-Johnson 2010). In contrast, syntactic structure becomes more important 
when it is the participants' task to judge the position of boundaries. This, again, is in line 
with findings for other languages (see, for example, Cole, Mo & Baek 2010). 

Table 5 lists the three most important factors determining prominence and boundary 
judgments in descending order. Importantly, and somewhat surprisingly from a Euro- 
pean perspective, the relevant prosodic factors are not only (almost) the same across 


5For boundaries, word duration is the fourth most influential parameter with an odds ratio of 2.5, and mean 
syllable duration is the third-most effective cue, with an odds ratio of 2.6. As this difference is extremely 
small, we regard these two factors as equally effective with regard to prosodic boundaries. In the case of 
prominences, the difference between word duration (odds ratio of 1.9) and mean syllable duration (odds ratio 
1.6) is somewhat more pronounced, but still not very large. 
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both experiments, but the ranking is also the same, i.e. pause in first and word duration/ 
mean syllable duration in third position. 


Table 5: Major effects for both experiments in terms of their effect size (odds 


ratio -OR) 
Experiment I: Prominences Experiment II: Boundaries 
OR OR 
pause 27 pause 22.9 
part-of-speech class 21 syntactic structure 3.4 
: 19 mean syllable duration /word du- 2.6/2.5 
word duration iato 


When comparing the odds ratios, it is obvious that the effect sizes for the prominence- 
lending parameters are smaller than their counterparts for boundary perception. The 
rather small effect sizes are linked to the very high degree of variability in the promi- 
nence ratings (low Kappa values). That is, the effects these parameters may have on 
prominence judgments clearly do not lead to substantial agreement with regard to these 
judgments. In fact, the high degree of variability raises the question of whether the no- 
tion of prominence makes any sense to PM speakers, a point we will return to below. 

For boundaries, by contrast, the effects seem to be more robust. Furthermore, the vari- 
ability observed here is within the range of variability observed for other languages (see 
84.1 above). Major phonetic cues for prosodic boundaries are pauses and longer word 
and syllable durations, which are widely attested cross-linguistically (see, for example, 
Turk & Shattuck-Hufnagel 2007). In fact, with regard to boundary perception, native 
hearing and Western auditory analysis appear to be quite similar, as revealed by the 
comparison of the expert rating by the four authors (cf. $2) and native listeners' judg- 
ments in Figure 10. In most cases in which experts did not perceive a prosodic boundary, 
the native raters also tended towards the perception of no boundary, which is indicated 
in the plot by larger dots for lower b-scores. However, in some cases where non-native 
experts did not perceive a boundary, there was a considerable agreement among na- 
tive raters that they perceived a boundary. The opposite pattern can be observed for 
instances in which the experts did perceive a boundary: there are fewer instances of low 
b-scores but (slightly) more instances of higher b-scores. Statistically, this pattern is mir- 
rored by a strong correlation for the perception of boundaries between the two groups 
(X?(1) = 2949.9, p < 0.0001). When the experts observed a boundary in contrast to no 
boundary, the odds of a boundary response by the native listeners increased by 26.8 to 
1 (logit estimate: 3.3, SE - 0.07), which is higher than the strongest factor influencing 
native speakers' boundary judgments (i.e. pause with an odds ratio of 22.9). 

Given the very weak inter-rater agreement results for prominence judgments and 
the fact that the same prosodic cues appear to play a role in judging prominences and 
boundaries, we tentatively conclude that the perception of prominence is to some extent 
conflated with the — more clearly conceptualized - perception of (prosodic) boundaries 
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Figure 10: Correlation between non-native (German) experts’ and native (PM) 
listeners’ boundary perception (indicated by binary scores and b-scores, respec- 
tively). The size of the dots indicates the number of compatible observations 
(the more observations, the larger the dot). 


in PM. This conclusion is in line with similar observations quoted from the literature in 
81, which also raise doubts as to the feasibility of separating prosodic prominences from 
prosodic boundaries in other Malayic varieties. 

It should be noted, however, that there is no perfect match between prominences and 
boundaries in that natively perceived boundaries are not reliable predictors for promi- 
nences. Testing the effect of b-scores on p-scores, we found an odds ratio of only 1.1 to 
1, although the likelihood-ratio test revealed significance. The small effect size is mainly 
due to the fact that substantially more prominences were marked than boundaries. Re- 
call from 83.2 that only 20 of the 726 words (i.e. 2.75%) occurring in the test items were 
unanimously judged to lack prominence. In contrast, of the 674 non-final words in the 
test utterances, 212 were unanimously judged not to precede a boundary (i.e. 31.45%). 

Perhaps the most surprising result of our preliminary exploration is the fact that pitch- 
related parameters do not appear to play a role for PM speakers in judging prominences 
and boundaries. Recall from $3.2 and $4.2 that only maximum and mean pitch were found 
to be marginally effective in the case of boundary judgments, but well below the more 
effective parameters pause, syntactic break, and mean syllable/word duration. This find- 
ing is particularly relevant because the claims of Western researchers regarding lexical 
stress differences in Malayic varieties appear to be primarily based on differences in pitch 
alignment, with high pitch targets being heard as located on either the penultimate or 
the ultimate syllable of a word. The production study by Maskikit-Essed & Gussenhoven 
(2016) for Ambonese Malay already questioned whether there is in fact a clear alignment 
of pitch targets with respect to syllable boundaries. Our study suggests that, although 
modulations of pitch are clearly present (acoustically as well as perceptually to the West- 
ern ear), these do not appear to play a role either in the perception of boundaries or in 
the marking of prosodic prominences in PM - and possibly other Malayic varieties. 
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In fact, prosodic prominence may not be a relevant category in PM and other Malayic 
varieties in general, which would thus represent further instances of what has been 
termed stress deafness (see e.g. Peperkamp & Dupoux 2002, Dupoux et al. 2010 for French). 
However, our results are not directly comparable with this line of work as the methods 
used quite clearly differ. It is also far from clear whether stress deafness is a homoge- 
neous phenomenon. Hence, it may turn out that the French and PM cases only partially 
overlap, if at all. 

We need more data to answer the question of whether PM listeners are really insen- 
sitive to prominence-lending pitch modulations. This includes the further question of 
whether they do not respond to pitch modulations at all, i.e. also when rating languages 
that are known to primarily use pitch in the marking of prosodic prominence (a study pre- 
senting German stimuli to Papuan Malay listeners is currently under way). If we were to 
find higher prominence scores when PM listeners rate German data, the present results 
would only support the conclusion that pitch modulations are not systematically em- 
ployed in prominence marking in PM, thus confirming similar findings in the literature 
reported in 81. If prominence scores by PM listeners prove to be similar across different 
languages, this would suggest a more general account in terms of stress deafness for PM 
listeners. 

We would also like to add a cautionary note regarding the notion of “(post-) lexical 
stress" as it has been used in this chapter and in much of the previous descriptive and 
experimental literature. Inasmuch as ‘stress’ is understood to be a phenomenon that per- 
tains to the phonologically organized highlighting of a syllable relative to adjacent ones 
by way of modulating phonetic parameters such as pitch and duration, the current study 
supports the conclusions of earlier studies that lexical stress is not part of the prosodic 
system of Malayic varieties. As pointed out in the introduction, the cases of Papuan and 
Ambon Malay are particularly interesting in this regard, because pitch modulations here 
appear to be - both acoustically and perceptually to the Western ear - very regular and 
clearly anchored to different syllables (penultimate or final), unlike in the Indonesian 
spoken by Javanese native speakers, where pitch modulations are much more variable. 

In this context, it should be noted that it is very well possible that in PM pitch targets 
are clearly aligned with syllables, in contrast to Maskikit-Essed & Gussenhoven's (2016) 
claims for Ambonese Malay. If this were to be the case, we would need a stress-like no- 
tion to be able to account for differing alignments of pitch targets with penultimate and 
final syllables which, however, would differ from the standard understanding of "lexical 
stress", as this distinction does not appear to be perceived as a prominence distinction 
by native speakers. 

While it thus seems very likely that prosodic prominence is organized differently in 
these languages, a number of phenomena may still need to be accounted for in stress- 
related metrical categories. To give just one more example, Kluge (2014) makes the oc- 
casional reference to stress distinctions in discussing segmental alternations in PM. An 
example is the observation that /s/ is only palatalized in unstressed syllables (Kluge 
2014: 73). If one denies lexical stress distinctions in the standard sense given above, one 
needs to identify another factor that adequately constrains the palatalization rule. Fur- 
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thermore, lexical stress in the sense of phonologically organized prominence distinctions 
is of course not the only possible prosodic organization at word level. Foot structure, for 
example, may be evident in terms of phenomena not directly reflected in phonetic differ- 
ences. Thus, it should be clearly understood that denying the existence of lexical stress 
in these languages does not mean that there is no word-prosodic organization at all. 
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Appendix A 


Instructions for Experiment I (Prominences) 


Pertama-tama kami mengucapkan terima kasih karena Anda bersedia berpartisipasi da- 
lam eksperimen tentang bagaimana Anda memahami bahasa. Jawaban yang Anda beri- 
kan tidak ada yang salah atau benar karena semuanya bergantung pada rasa bahasa. 

Dalam berbicara seseorang akan mengucapkan beberapa atau banyak kata dalam se- 
buah kalimat dengan nada yang lebih menonjol dibandingkan dengan kata-kata lain 
yang terdapat dalam kalimat tersebut. Kata-kata dengan nada yang menonjol ini bia- 
sanya dapat dirasakan oleh pendengarnya. Tugas Anda adalah menandai (menggaris- 
bawahi) kata-kata yang nadanya Anda dengar lebih menonjol dibandingkan dengan 
kata-kata lain dalam rekaman kalimat yang akan Anda putar. 

Berikut ini Anda akan diputarkan 56 kalimat. Setiap kalimat juga akan disajikan dalam 
bentuk tertulis. Untuk mulai silakan klik Contoh 1, dst. 

Tugas Anda adalah menggarisbawahi semua kata yang nadanya Anda anggap lebih 
menonjol (mis. lebih tinggi) dibandingkan dengan kata-kata lain pada setiap rekaman 
kalimat yang Anda dengarkan. Silakan garis bawahi kata tersebut dengan cara seperti 
ini: 
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Dia melihat sapi 


Dalam hal ini, Anda dimungkinkan untuk memilih lebih dari satu kata pada setiap reka- 
man kalimat! 


Dia melihat sapi dan kuda makan rumput 


Anda dapat memutar setiap rekaman kalimat sebanyak dua kali. Akan tetapi, tidak me- 
mungkinkan untuk menghentikan rekaman pada saat contoh kalimat sedang diputar. 

Jika Anda harus mengoreksi pilihan Anda, silakan coret kata yang telah Anda garis 
bawahi dengan cara seperti ini: 


Sapi 


Selamat mengikuti eksperimen ini! 


Instructions for Experiment II (Boundaries) 


Ketika seseorang berbicara, dia akan membagi ucapan mereka menjadi potongan-po- 
tongan. Potongan-potongan tersebut membentuk kelompok kata-kata yang memudah- 
kan pendengar untuk memahami ucapan pembicara. Potongan-potongan tersebut pent- 
ing terutama saat pembicara memproduksi ucapan yang panjang. 

Contoh potongan yang mungkin Anda ketahui adalah potongan nomor ketika Anda 
memberi tahu nomor telepon Anda kepada orang lain. Biasanya, Anda tidak setiap kali 
memberi satu nomor (0, 8, 1, 3 ...), tetapi Anda akan memotong nomor hp tersebut men- 
jadi kelompok-kelompok yang terdiri atas dua, tiga, atau empat angka (081, 358, 772 ...). 

Untuk rekaman yang akan Anda dengar, Anda diminta untuk menandai potongan 
dengan cara menyisipkan garis tegak lurus atau vertikal (pada cetakan) untuk bagian 
yang Anda dengar sebagai satu potongan. Batas antara dua potongan tidak harus sama 
dengan lokasi tempat Anda akan menulis tanda koma, titik, atau tanda baca lainnya. Jadi, 
Anda harus benar-benar hati-hati mendengar ujaran dan tandai batas yang Anda dengar 
sebagai akhir sebuah potongan. 

Sebuah potongan mungkin saja berupa satu kata, atau mungkin terdiri atas beberapa 
kata, dan ukuran (jumlah kata) dalam setiap potongan dari para pembicara bisa saja 
berbeda-beda dalam satu ujaran. Beberapa ujaran mungkin Anda dengar konsisten, yaitu 
terdiri atas satu potongan saja. Jika demikian, Anda tidak perlu menandai batas potong- 
an. 

Contoh: 


081[358|772... 
0813[5877]2... 
Bapak saya | sudah datang 
Bapak | saya sudah datang 
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Appendix B 


Instructions for Experiment I (Prominences) 


First of all, we want to say thank you for participating in our experiment on how people 
perceive language. There is no right or wrong answer - we are just interested in your 
innate sense of language P 

When talking, people will stress or emphasize some words within a sentence more 
than others. These stressed words can usually be perceived by the hearer. Your task in this 
experiment is to point out (underline) all words that you perceive to be more emphasized 
compared to the rest of the utterance in the recordings that we will play to you. 

You will hear 56 sentences. You will also receive each sentence as a written transcript. 
To start, please click Example 1, and so on. 

Your task is to underline all words that you perceive to stick out (e.g. because they are 
higher/louder) compared to the other words in each recording that you will hear. Please 
underline your choice in the following way: 


He sees a cow 
It is possible to choose more than one word for each recording! 
He sees a cow and a horse eating grass 


You can play each recording twice. It will not be possible to stop the recording while it 


is playing. 
If you want to make a correction to your choice, please cross out the underlined word. 


SE 


Enjoy the experiment! 


Instructions for Experiment II (Boundaries) 


When people speak, they chunk their utterances into units. These chunks of words help 
the hearer to understand the utterance. They are especially important if the speaker 
produces longer, coherent speech. 

An example you might be familiar with is the chunking of digits when giving some- 
body your telephone number. Instead of spelling one digit after another (0, 8, 1, 3 ...), it 
is common to divide the number into units consisting of two, three, or four digits each 
(081, 358, 772 ...). 

For the recordings you will hear, you are asked to mark those chunks by inserting a 
vertical line (on the printout) to divide what you perceive to be a unit. The boundary 
between two chunks does not necessarily have to coincide with where one would write 


Note that the English transations are free rather than literal translations of the Indonesian original. 
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a comma, a full stop or any other punctuation, so please listen carefully and draw the 
line where you hear the end of one unit. 

One unit might consist of one word only, or it can contain several words - the size ofa 
unit might vary from utterance to utterance. Some recordings might consist of one unit 
only. If this is the case, you don't have to draw a boundary. 

Examples: 


081[358|772... 
0813[5877]2... 

let's eat grandpa 
let's eat | grandpa 
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Perspectives on information 
structure in Austronesian languages 


Information structure is a relatively new field to linguistics and has only recently been studied 
for smaller and less described languages. This book is the first of its kind that brings together con- 
tributions on information structure in Austronesian languages. Current approaches from formal 
semantics, discourse studies, and intonational phonology are brought together with language 
specific and cross-linguistic expertise of Austronesian languages. The 13 chapters in this vol- 
ume cover all subgroups of the large Austronesian family, including Formosan, Central Malayo- 
Polynesian, South Halmahera-West New Guinea, and Oceanic. The major focus, though, lies on 
Western Malayo-Polynesian languages. Some chapters investigate two of the largest languages 
in the region (Tagalog and different varieties of Malay), others study information-structural phe- 
nomena in small, underdescribed languages. The three overarching topics that are covered in 
this book are NP marking and reference tracking devices, syntactic structures and information- 
structural categories, and the interaction of information structure and prosody. Various data 
types build the basis for the different studies compiled in this book. Some chapters investigate 
written texts, such as modern novels (cf. Djenar's chapter on modern, standard Indonesian), or 
compare different text genres, such as, for example, oral narratives and translations of biblical 
narratives (cf. De Busser's chapter on Bunun). Most contributions, however, study natural spoken 
speech and make use of spoken corpora which have been compiled by the authors themselves. 
The volume comprises a number of different methods and theoretical frameworks. Two chap- 
ters make use of the Question Under Discussion approach, developed in formal semantics (cf. 
the chapters by Latrouite & Riester; Shiohara & Riester). Riesberg et al. apply the recently de- 
veloped method of Rapid Prosody Transcription (RPT) to investigate native speakers' perception 
of prosodic prominences and boundaries in Papuan Malay. Other papers discuss theoretical con- 
sequences of their findings. Thus, for example, Himmelmann takes apart the most widespread 
framework for intonational phonology (ToBI) and argues that the analysis of Indonesian lan- 
guages requires much simpler assumptions than the ones underlying the standard model. Arka 
& Sedeng ask the question how fine-grained information structure space should be conceptual- 
ized and modelled, e.g. in LFG. Schnell argues that elements that could be analysed as "topic" 
and "focus" categories, should better be described in terms of ‘packaging’ and do not necessarily 
reflect any pragmatic roles in the first place. 
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